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Abstract.  This  tutorial  presents  a  theory  of  valid  inequalities  for  mixed  integer  linear  sets. 
It  introduces  the  necessary  tools  from  polyhedral  theory  and  gives  a  geometric  understanding 
of  several  classical  families  of  valid  inequalities  such  as  lift-and-project  cuts,  Gomory  mixed 
integer  cuts,  mixed  integer  rounding  cuts,  split  cuts  and  intersection  cuts,  and  it  reveals  the 
relationships  between  these  families.  The  tutorial  also  discusses  computational  aspects  of  gen¬ 
erating  the  cuts  and  their  strength. 
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1.  Introduction 

In  this  tutorial  we  consider  mixed  integer  linear  programs.  These  are  problems 
of  the  form 

max  cx  +  hy 

Ax  +  Gy  <  b 

x  >  0  integral 
V  >  0, 

where  the  data  are  the  row  vectors  c  £  Q",  h  £  Qp,  the  matrices  A  £  (Qmxn, 
G  £  Qmxp  and  the  column  vector  b  £  Qm;  and  the  variables  are  the  column 
vectors  x  £  R"  and  y  £  Rp.  The  set  S  of  feasible  solutions  to  this  mixed  integer 
linear  program  is  called  a  mixed  integer  linear  set.  That  is,  if  P  :=  {(x,y)  € 
R"  x  R+  :  Ax  +  Gy  <  6},  then  S  :=  P  fl  (Z"  x  R(j_).  It  can  be  shown  that 
conv(S')  is  a  polyhedron  (see  for  example  Nemhauser  and  Wolsey  [46]).  In  order 
to  solve  a  mixed  integer  linear  program,  it  is  therefore  equivalent  to  solve  the 
linear  program 
max  cx  +  hy 

( x ,  y)  €  conv(S') 

since  the  extreme  points  of  conv(S')  are  in  S.  This  is  the  polyhedral  approach. 
The  difficulty,  of  course,  is  to  construct  inequalities  defining  conv(5')  given  those 
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defining  P.  This  tutorial  presents  approaches  for  constructing  improved  approx¬ 
imations  of  conv(S')  recursively.  An  inequality  is  said  to  be  valid  for  a  set  if  it  is 
satisfied  by  every  point  in  this  set.  A  cut  with  respect  to  a  point  (x,  y)  £  conv(S') 
is  a  valid  inequality  for  conv(S')  that  is  violated  by  (x.  y).  The  polyhedral  ap¬ 
proach  to  solving  mixed  integer  linear  programs  leads  naturally  to  the  cutting 
plane  approach:  Solve  the  linear  programming  relaxation  obtained  by  ignoring 
the  integrality  requirements  on  x:  if  this  relaxation  is  unbounded  or  infeasible, 
then  stop:  the  mixed  integer  linear  progam  is  infeasible  or  unbounded;  otherwise 
let  (x.  y)  be  an  optimal  extreme  point  solution;  if  (x,  y)  £  S,  then  stop:  (x,  y) 
is  an  optimal  solution  of  the  mixed  integer  linear  program;  otherwise  generate  a 
cut  with  respect  to  (x,  y) ,  add  it  to  the  previous  linear  programming  relaxation, 
solve  this  improved  relaxation,  and  repeat.  In  this  tutorial,  we  give  a  geometric 
understanding  of  several  classical  families  of  cuts  such  as  split  cuts  [25],  Go- 
mory  mixed  integer  cuts  [34],  intersection  cuts  [4]  and,  in  the  mixed  0,1  case, 
lift-and-project  cuts  [54],  [43],  [7].  We  study  the  relationships  between  these  var¬ 
ious  families  and  discuss  computational  aspects  of  these  cuts.  Cuts  that  exploit 
specific  structures  of  the  constraints  are  also  the  object  of  a  vast  literature  but 
they  are  not  addressed  in  this  tutorial.  We  start  with  some  useful  results  in 
polyhedral  theory. 


2.  Polyhedra 

A  polyhedron  in  R”  is  a  set  of  the  form  P  :=  {x  £  R”  :  Ax  <  b}  where  A  is  a 
real  matrix  and  b  a  real  vector.  If  A  and  b  have  rational  entries,  P  is  a  rational 
polyhedron.  The  polyhedron  {x  £  R"  :  Ax  <  0}  is  called  a  polyhedral  cone.  Note 
that  a  polyhedral  cone  is  always  nonempty  since  it  contains  the  null  vector  0. 

For  S  C  R™,  the  convex  hid l  of  S  is  the  set  conv(S')  :=  {a;  G  R"  :  x  = 
J2i=1  where  k  >  1,  A  £  R+ ,  J2i=i  =  1  and  x1,.. xk  £  S'}.  This  is  the 

smallest  convex  set  that  contains  S.  It  will  sometimes  be  useful  to  work  with 
conv(S),  the  closure  of  conv(S),  which  is  the  smallest  closed  convex  set  that 
contains  S.  The  conic  hull  of  a  nonempty  set  S  C  R"  is  cone(S)  :=  {x  £  R"  : 
x  =  i  AjX*  where  k  >  1,  A  £  R+  and  x1, . . . ,  xk  £  S}. 

The  convex  hull  of  a  finite  set  of  points  in  R"  is  called  a  polytope. 

An  important  theorem,  due  to  Minkowski- Weyl,  states  that  every  polyhedron 
P  can  be  written  as  the  sum  of  a  polytope  Q  and  a  polyhedral  cone  C.  Here 
Q  +  C  :=  {.t  €  R"  :  x  =  y  +  z  for  some  y  £  Q  and  2  £  C}.  Note  that 
P  =  0  if  and  only  ifQ  =  0.  IfP  :=  (a:  £  R"  :  Ax  <  b}  is  nonempty,  then 
C  =  [i  £  R"  :  Ax  <  0}  is  unique.  The  cone  {x  €  Rn  :  Ax  <  0}  is  called  the 
recession  cone  of  P. 

Another  useful  result  is  Farkas’s  lemma:  For  a  nonempty  polyhedron  P  := 
{x  :  Ax  <  b},  the  inequality  ax  <  (3  is  valid  for  P  if  and  only  if  there  exists  a 
vector  v  >  0  such  that  vA  =  a  and  vb  <  (3.  [This  is  also  a  consequence  of  LP 
duality:  (3  >  maxjaa:  :  Ax  <  6}  =  min{v6  :  vA  =  a,  v  >  0}.]  Another 
form  of  Farkas’s  lemma  is:  Ax  <  b  has  a  solution  if  and  only  if  vb  >  0  for  all 
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v  >  0  such  that  vA  =  0.  [Equivalently,  by  LP  duality,  max{0  :  Ax  <  b}  = 
min {vb  :  vA  =  0,  v  >  0}.] 

The  proof  of  these  results  can  be  found  in  textbooks  such  as  Schrijver  [53] 
or  Ziegler  [58]. 


2.1.  Some  properties  of  polyhedra  and  convex  sets 
This  section  contains  two  lemmas  that  will  be  used  later. 

Lemma  1.  Let  P  :=  ji  €  R“  :  Ax  <  b}  be  a  polyhedron  and  let  77  :=  P  n  {x  : 
nx  <  7To } .  If  77  ^  0  and  ax  <  P  is  a  valid  inequality  for  77,  then  there  exists  a 
scalar  A  £  R+  such  that 

ax  —  A(nx  —  7To)  <  P 

is  valid  for  P. 

Proof.  By  Farkas’s  lemma,  since  77  0,  there  exist  u  >  0,  A  >  0  such  that 

a  =  uA  +  An 
and  P  >  ub  +  Ang. 

Since  uAx  <  ub  is  valid  for  P,  so  is  uAx  <  p  —  An$ .  Since  uAx  =  ax  —  Anx ,  the 
inequality  ax  —  A(nx  —  no)  <  P  is  valid  for  P.  □ 


Remark  1.  Lemma  1  has  two  possible  outcomes:  When  the  hyperplane  nx  =  no 
is  parallel  to  ax  =  P,  then  by  choosing  A  such  that  a  =  An,  we  get  the  trivial 
inequality  A7To  <  P,  satisfied  by  all  points  in  Rn  (for  instance,  the  lemma  holds 
when  P  =  Rn);  in  the  more  interesting  case  when  nx  =  no  is  not  parallel  to 
ax  =  P,  the  outcome  of  the  lemma  is  a  closed  half-space  containing  P  (see 
Figure  1). 
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VT0)  <  P 


Remark  2.  The  assumption  77  ^  0  is  necessary  in  Lemma  1,  as  shown  by  the 
following  example  (see  Figure  2):  P  :=  {x  €  R2  :  x\  >  0,  x 2  >  0}  and 
II  :=  P  H  {x  :  a?2  <  — 1}-  Thus  77  is  empty.  The  inequality  x\  <  1  is  valid  for 
II  but  there  is  no  scalar  A  such  that  x\  —  \{xi  +  1)  <  1  is  valid  for  P. 


Let  H  C  Rra  be  a  hyperplane  and  S  C  Rn.  In  general,  conv(S')  fl  H  ^ 
conv(S'  n  H)  as  shown  by  the  following  example:  S  consists  of  two  points  not 
in  H  but  the  line  segment  connecting  them  intersects  H.  The  following  lemma 
shows  that  equality  holds  when  S  lies  entirely  in  one  of  the  closed  half  spaces 
defined  by  the  hyperplane  H  (see  Figure  3). 

Lemma  2.  Let  H  :=  {x  £  R"  :  ax  =  6}  be  a  hyperplane  and  S  C  {a;  :  ax  <  b}. 
Then  conv(S)  fl  H  =  conv(S  D  H). 


Proof.  We  prove  first  conv(5  D  H)  C  con v (.S')  fl  H  and  then  conv(S')  fl  H  C 
conv(5  fl  H). 
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Clearly  conv(5  fl  H)  C  conv(S')  and  conv(,S  fl  H)  C  H  so  the  first  inclusion 
is  obvious. 

To  prove  the  other  inclusion,  let  x  £  conv(S')  fi  H.  This  means  ax  =  b  and 
x  =  A*#*  where  x1, . . . ,  xk  €  S,  A  >  0  and  J2i=i  =  1- 

k  k 

b  =  ax  =  A iax1  <  ^  A ib  =  b  (1) 

»= 1  i—1 

where  the  inequality  follows  from  ax 1  <  b  and  A,  >  0.  Relation  (1)  implies  that 
these  inequalities  are  in  fact  equations,  i.e.  ax 1  =  b  for  i  =  1, . . . ,  k.  Therefore 
x1  £  5  fl  H.  This  implies  x  £  conv(5  Pi  H).  □ 


2.2.  Facets 

Let  P  :=  {x  £  M"  :  Ax  <  b}  be  a  polyhedron.  A  face  of  P  is  a  set  of  the  form 

F  :=  P  fl  {x  €  Rn  :  ax  =  /?} 

where  ax  <  /3  is  a  valid  inequality  for  P  (the  inequality  is  said  to  define  the  face 
F).  A  face  is  itself  a  polyhedron  since  it  is  the  intersection  of  the  polyhedron  P 
with  another  polyhedron  (the  hyperplane  ax  =  (3 ).  The  dimension  of  a  set  is  the 
dimension  of  the  smallest  affine  space  that  contains  it.  The  faces  of  dimension 
0  are  called  vertices  of  P,  those  of  dimension  1  are  called  edges  and  those  of 
dimension  dim(P)  —  1  are  called  facets.  The  proof  of  the  following  theorem  can 
be  found  in  Schrijver  [53]. 

Theorem  1.  Let  P  C  Rn  be  a  nonempty  polyhedron. 

(i)  For  each  facet  F  of  P,  at  least  one  of  the  inequalities  defining  F  is  nec¬ 
essary  in  any  description  Ax  <b  of  P. 

(ii)  Inequalities  defining  faces  of  dimension  less  than  dim(P)  —  1  are  not 
needed  in  the  description  of  P  and  can  be  removed. 

This  result  states  that,  if  a  polyhedron  in  ln  has  m  facets,  any  representa¬ 
tion  by  a  system  of  linear  inequalities  in  R"  contains  at  least  m  inequalities.  In 
integer  linear  programming,  we  often  consider  polyhedra  that  are  given  implic¬ 
itly  as  conv(S')  (see  the  Introduction).  It  is  not  unusual  for  such  polyhedra  to 
have  a  number  of  facets  that  is  exponential  in  the  size  of  the  input.  Thus  their 
representation  by  linear  inequalities  in  R"  is  large.  In  some  cases,  there  is  a  way 
to  get  around  this  difficulty:  a  polyhedron  with  a  large  number  of  facets  can 
sometimes  be  obtained  as  the  projection  of  a  polyhedron  with  a  small  number 
of  facets.  For  this  reason,  projections  turn  out  to  be  an  important  topic  in  this 
tutorial. 
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2.3.  Projections 

Let  P  C  R"+p  where  ( x ,  y)  £  P  will  be  interpreted  as  meaning  x  £  R”  and 
y  £  Rp.  The  projection  of  P  onto  the  x-space  R"  is 

proja,(P)  :=  {x  el"  :  By  £  Rp  with  (x,  y)  £  P }. 


y 


projx(P) 


Fig.  4.  Projection 


X 


Theorem  2.  Let  P  :=  {( x,y )  eR"xRp:  Ax  +  Gy  <  b}.  Then  projx{P)  =  {x  £ 
R"  :  v*(b  —  Ax)  >  0  for  all  t  £  T}  where  {t ’*}teT  is  the  set  of  extreme  rays  of 
Q  :=  {v  £  Rm  :  vG  =  0,v  >  0}. 

Proof.  Let  x  £  R”.  By  Farkas’s  Lemma,  Gy  <  b  —  Ax  has  a  solution  y  if  and 
only  if  P(6  —  Ax)  >  0  for  all  extreme  rays  v*  of  vG  =  0,  v  >  0.  □ 

Remark  3.  Variants  of  Theorem  2  can  be  proved  similarly: 

If  V  >  0  in  P,  then  the  relevant  cone  Q  is  :  vG  >0,  v  >  0}. 

If  y  >  0  and  Ax  +  Gy  =  b  in  P,  the  relevant  cone  is  {r>  :  vG  >  0}  with  v 
unrestricted  in  sign. 

Enumerating  the  extreme  rays  of  Q  may  not  be  an  easy  task  in  applications. 
Another  way  of  obtaining  the  projection  of  P  is  to  eliminate  the  variables  yi  one 
at  a  time  (Fourier-Motzkin  elimination  procedure): 

Consider  a  polyhedron  P  C  R"+1  defined  by  the  system  of  inequalities: 

n 

aijXj  +  giZ  <  bi  for  i  £  I.  (2) 

i= i 

Let  1°  =  {i  £  I  :  gi  =  0},  I+  =  {?'  £  I  :  gt  >  0},  I~  =  {i  £  I  :  gi  <  0}. 
The  Fourier-Motzkin  procedure  eliminates  the  variable  z  as  follows:  It  keeps  the 
inequalities  of  (2)  in  /  ,  and  it  combines  each  pair  of  inequalities  i  £  I+  and 
l  £  I~  to  eliminate  z. 

Theorem  3.  The  system  of  |/°|  +  |/+||/“|  inequalities  obtained  by  the  Fourier- 
Motzkin  procedure  is  the  projection  projx(P)  of  P  in  the  x-space  R”. 
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Proof.  (2)  implies 


z  <  —(h  - 
9i 


E 

3= 1 


/  cbijXj)  V*  G  P 


z  >  — (6/  —  '^2  aijxj )  V?  €  I  . 
91  3=1 

Therefore,  every  (x,  z)  in  (2)  satisfies 


1 

9i 


( h  -^aijXj)  < 

3=1 


(bi  ^  '  djjXj). 


9i 


3=1 


(3) 


Conversely,  if  x  satisfies  the  inequalities  (2)  for  i  G  1°  and  the  system  (3)  for 
every  i  G  /+  ,1  G  I~ ,  then 

^  n  i  n 

max — (bi  —  >  auXi )  <  min  — (bi  —  >  a^xf). 
lei-  gi  ^  *e/+  5i  ^ 

j=i  j=i 


Let  2  be  any  value  in  this  interval.  Then  (x,  z)  satisfies  (2). 


□ 


2-4-  Union  of  polyhedra 

In  this  section,  we  prove  a  result  of  B alas  [5],  [6]  about  the  union  of  polyhedra. 
Consider  k  polyhedra  Pi  :=  {x  G  Rra  :  AiX  <  b1}.  We  will  show  that  the  smallest 
closed  convex  set  that  contains  U^=1Pi  is  a  polyhedron.  This  set  is  denoted  by 
conv(U^=1  Pf). 

The  closure  is  needed  as  shown  by  the  following  example:  P\  consists  of  a 
single  point,  and  P2  is  a  line  that  does  not  contain  the  point  P\  (see  Figure  5).  Let 
P3  denote  the  line  going  through  Pi  that  is  parallel  to  P2.  It  is  easy  to  verify  that 
conv(Pi  UP2)  =  conv(P2  U P3)  and  that  conv(Pi  U P2)  =  conv(P2  U P3)  \  (P3  \ Pi) 
(indeed,  a  point  x*  in  P3  \  Pi  is  not  in  conv(Pi  U  P2),  but  there  is  a  sequence 
of  points  xk  G  conv(Pi  U  P2)  that  converges  to  x*).  Here  conv(Pi  U  P2)  is  not  a 
closed  set,  and  therefore  it  is  not  a  polyhedron. 


conv(P!  U  P2) 


P2 


Fig.  5.  conv(Pj  U  P2)  7^  conv(Pi  U  P2) 
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The  following  example  shows  that  conv(U^_1Pj)  can  have  an  exponential 
number  of  facets  in  the  size  of  the  input  Pj,  i  =  1, . . . ,  k. 

Let  el  denote  the  i-tli  unit  vector  in  R™.  Let  Pi  be  the  single  point  e*  for 
i  =  1 ,n,  and  Pn+i  the  single  point  —  el  for  i  =  1, . . . , n.  Then  conv(uf"1Pi) 
has  2n  facets,  namely 

2  n  n 

conv(|^J  Pi)  =  {i  £  R“  :  ^  eiXi  <  1  for  all  e  €  {—1,  +1}"}. 

i—l  i= 1 

This  polyhedron  is  called  the  octahedron. 


Although  convfujL-j  Pt)  may  have  exponentially  many  facets,  Balas  [5],  [6] 
proved  that  it  is  the  projection  of  a  higher  dimensional  polyhedron  Y  with  a 
polynomial  size  representation: 


Y  := 


AiX1  <  blyi 
Yf,  xl  =  x 
J2vi  =  1 

Hi  >  0  for  i  =  1, . . . ,  k. 


(4) 


In  this  formulation,  xl  is  a  vector  in  Rn  and  yi  is  a  scalar,  for  i  =  1, . . . ,  k. 
The  vector  x  €  R"  corresponds  to  the  original  space  onto  which  Y  is  projected. 
Thus,  the  polyhedron  Y  is  defined  in  a  formulation  with  a  polynomial 

number  of  variables  and  constraints  is  said  to  be  compact.  The  gist  of  Balas’s 
result  is  that  conv(U^_1Pj)  has  a  compact  representation.  The  proof  is  given 
below. 

Let  Pi  :=  {x  €  Rn  :  Atx  <  6*}  be  a  polyhedron  and  Ci  :=  {x  €  R"  :  A.tx  < 
0}  its  recession  cone.  By  the  Minkowski- Weyl  theorem,  there  exists  a  polytope 
Qi  such  that  P,  =  Qi  +  Ci. 


Theorem  4.  Consider  k  polyhedra  Pi  :=  {x  €  1”  :  A-iX  <  b1}  and  let  Ci  := 
{x  €  R"  :  AiX  <  0}.  Let  Pi  =:  Qi  +  Ci  where  Qi  is  a  polytope.  //UP*  0, 

assume  that  Cj  C  cone  U i-.p^Q  Ci  for  all  j  such  that  Pj  =  0.  Then  the  following 
sets  are  identical. 
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(i)  cdnv(U^=1Pi) 

(ii)  conv(U^=1Qi)  +  cone(ufL1C'l) 

(in)  projxY,  where  Y  is  defined  in  (f). 

Proof.  Define  Q  :=  convfU^L-j  Qt)  and  C  :=  cone(U^=1C',).  We  will  show  that 

conv(U,f=1Pi)  C  proj^F  CQ  +  CC  conv(U^=1Pi). 

Since  proj^F  is  a  closed  set,  the  first  inclusion  implies  conv(U^=1P;)  C  proj^Y 
and  the  theorem  follows. 

(a)  conv(uf=1Pj)  C  proj^F 

The  result  holds  when  U f=1P;  =  0,  so  we  assume  uf=1P;  ^  0.  Without 
loss  of  generality,  P\, ...  ,Ph  are  nonempty  and  Ph+i,  ■  ■  ■  ,Pk  are  empty,  where 
1  <  h  <  k. 

Let  x  £  conv(ujL1Pi).  Then  a:  is  a  convex  combination  of  a  finite  number  of 
points  in  U^=1  P(.  Since  each  Pj  is  convex,  we  can  write  a;  as  a  convex  combination 
of  points  z%  £  Pi,  say  x  =  Yi=  i  where  y*  >  0  for  i  =  1 ,h  and  Yi=i  V* 

.  Let  xl  =  yi z1  for  *  =  1, . .  h  and  yi  =  0,  x*  =  0  for  i  =  h  +  1, . . . ,  k.  Then 
AiX1  <  blyi  for  i  =  1, . . .  k  and  x  =  Yi=i  x%-  This  shows  that  x  €  proj^.F  and 
therefore  (a)  holds. 

(b)  proja,F  CQ  +  C 

The  result  holds  if  F  =  0,  so  we  assume  F  /  0. 

Let  x  £  proj^F.  By  the  definition  of  projection,  there  exist  x1, . . . ,  a .  ,y  such 
that  x  =  Y)i= i  x 1  where  Ax1  <  blyi,  Y  Vi  =  1,  V  >  0.  Let  /  :=  {i  :  yi  >  0}. 

For  i  £  I,  let  zl  :=  — .  Then  zl  £  Pi.  Since  P;  =  Qi  +  Ci,  we  can  write 
zl  =  wl  +  xl  where  to*  £  Qi  and  xl  £  Ci- 

For  i  ^  J,  we  have  AiX1  <  0.  That  is  xl  £  Ci. 

x  =  ^2  yc1  +  ^2X' 

iel  igl 

k 

=  ^  yiW 1  +  ^  A iXl  where  Xi  >  0 

i£l  i—  1  ^ 

GConv(uQi)  GCone(uCi) 

£  Q  +  C. 

(c)  Q  +  C  C  conv(uk=1  Pf) 

The  result  holds  when  Q  =  0,  so  we  assume  Q  yf  0.  Without  loss  of  generality 
Q i, . .  • ,  Qh  are  nonempty  and  Qh+i,  ■  ■  ■ ,  Qk  are  empty,  where  1  <  h  <  k. 

Let  x  £  Q  +  C.  Then  x  =  Yi=i  ViwX  +  Xu=i  xl  where  wl  £  Qi,  yi  >0  for 
i  =  1, . . . ,  h,  Yi=i  Vi  =  1  and  x1  £  Ci  for  i  =  1, . . .  k.  By  the  assumption  in  the 
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statement  of  the  theorem  we  have,  for  j  =  h  +  1, . . . ,  k,  x J  tAxl *  where 

(/?■  >  0  and  x u  €  Ci  for  i  =  1, . . . ,  h. 

Based  on  the  above  vector  y  €  R^,  define  /:={*:  >  0}  and  consider  the 

point 

u  h  k 

xe  ■=  ^2(yi  -  ijj- e)wl  +  Y  e(V  +  -(x*+  Y  k^x13)) 

iel  '  '  i= 1  j=/i+l 

for  e  >  0  small  enough  so  that  yi  —  jjj-e  >  0  for  all  i  £  I. 

xe  e  conv(uf=1Pj)  since  J2iei(yi  -  ]7|  e)  +  Z)£=i  e  =  1  and  +  f  (z*  + 

EjU+i  Mi*”)  C 

Furthermore  a;e  — >  a;  as  e  — >  0. 

Therefore  x  €  conv(U*=1Pj).  □ 

Remark  4-  The  assumption  in  Theorem  4  is  necessary  as  shown  by  the  following 
example  (see  Figure  7):  Pi  :=  {x  €  R2  :  0  <  x  <  1}  and  P2  :=  {x  G  I2  :  X\  < 
0,Xi  >  1}.  Note  that  P2  =  0  and  C2  =  {1  £  l2  :  X\  =  0}.  The  theorem  does 
not  hold  in  this  case  since  proj^y  =  Pl  +  C2  =  {x  £  R2  :  0  <  x\  <  1},  which  is 
different  from  conv(Pi  U  P2)  =  Pl. 


\c2 

Fig.  7.  conv(Pi  U  P2)  7^  proj^Y 


Remark  5.  The  assumption  in  Theorem  4  is  automatically  satisfied  if 

(i)  Ci  =  {0}  whenever  Pi  =  0,  or 

(ii)  all  the  polyhedra  Pi  have  the  same  recession  cone. 

For  example  (i)  holds  when  all  the  P^s  are  nonempty,  or  when  Ci  =  {0}  for  all  i. 

Remark  6.  If  all  the  polyhedra  P,  have  the  same  recession  cone  C  =  Ci  for 
i  =  1, . . . ,  k,  then  conv(uf=iPj)  =  conv(U^=.1Pj).  Indeed,  in  this  case,  part  (c)  of 
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the  above  proof  simplifies  and  shows  that  Q  +  C  C  conv(Lr=1  P,:).  Furthermore 
U Pi  is  the  x-projection  of 

f  AiX1  <  blyi 

U:=  5X‘  =  f 

I  E  Vi  =  1 

[  y,  £  {0, 1}  for  *  =  1, ,  k. 

To  prove  the  last  claim,  observe  that  if  x  £  U Pi,  then  x  £  Pj  for  some 
j  =  1 ,k.  Setting  xJ  =  x,  yj  =  1  and  xl  =  0,  yi  =  0  for  i  ^  j,  we  have  a 
solution  (XjX1, . . .  ,xk,y )  £  Yj.  Conversely,  given  (x,  x1, . . .  ,xk,y)  £  Yj,  let  j  be 
the  index  for  which  yj  =  1.  We  have  Ajx°  <  b°  and  AjX1  <  0  for  i  ^  j,  namely 
xJ  £  Pj  and  x1  is  in  the  recession  cone  of  Pi  for  i  ^  j.  Since  the  recession  cone 
of  Pi  is  also  the  recession  cone  of  Pj,  the  point  x  =  Ei=i  belongs  to  Pj. 

Remark  1.  When  the  PiS  have  different  recession  cones,  the  x-projection  of  Yj  is 
usually  different  from  UP;  as  shown  by  the  following  example  (see  Figure  5). 

Let  Pi  C  R2  consist  of  the  single  point  (0,1),  and  let  P2  C  R2  be  the  line 
X2  =  0.  Then  we  claim  that  the  x-projection  of  Yj  is  P2  U  P3  where  P3  denotes 
the  line  X2  =  1.  Indeed,  when  yi  =  1  in  Yj,  x  =  x1  +  x2  with  x1  £  P\  and  x2  in 
the  recession  cone  of  P2,  which  is  P2  itself  in  this  example.  So  the  x-projection 
of  Yi  restricted  to  yf,  =  1  is  the  line  P3.  The  x-projection  of  Yj  restricted  to 
?/2  =  1  is  the  line  P2,  which  proves  the  claim. 

In  fact,  under  the  assumption  of  Theorem  4,  we  have 

proj  XY!  =  U  j=1Qi  +  cone(Ui=iCi). 


Application:  Theorem  4  has  been  applied  to  several  basic  models  that  ap¬ 
pear  repeatedly  in  mixed  integer  programming  [37],  [45],  [56],  [3],  [23].  As  an 
example,  we  consider  a  mixed  integer  linear  set  studied  by  Miller  and  Wolsey 
[45]  and  Van  Vyve  [56]  under  the  name  of  continuous  mixing  set.  We  present 
it  here  in  a  form  that  arises  in  the  context  of  robust  mixed  0,1  programming, 
following  Atamtirrk  [3]: 

S  :=  {(x,  y,  z)  £  {0,  l}n  x  R"  x  R+  :  yi  +  z  >  diXi  for  *  =  1, . . . ,  n} 

where  d  £  R"  is  a  given  vector.  The  polyhedron  conv(S')  has  an  exponential 
number  of  facets  but  has  a  compact  formulation  in  a  higher  dimensional  space. 
To  derive  this  formulation,  observe  that  the  variable  2  takes  either  the  value  0 
or  di  for  i  =  1, . . . ,  n  in  every  vertex  of  conv(iS).  Set  do  :=  0.  Let  S  denote  any 
of  these  n  +  1  values,  and  define 

S(6)  :=  {(x,  y,  z)  £  {0, 1}"  x  R"  x  S  :  yi  >  d^Xi  —  5  for  *  =  1, . . . ,  n}. 

Since  the  constraints  of  S(S)  are  decoupled,  one  can  strengthen  each  separately 
to  obtain  the  convex  hull  of  S(S): 

conv(S'(5))  :=  {(x,  y,  z)  £  [0,  l]n  x  R"  x  d  :  yi  >  (di  —  <5)+x;  for  i  —  1, . . . ,  n}. 
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Since  z  €  {c?o,  dn}  for  every  vertex  of  conv(S'),  we  have 

conv(S)  =  conv(U"=0conv(5(d,)))  +  C 

where  C  :=  {( x,y,z )  e  1"  x  1"  x  I  :  x  =  0,  y  >  0,  z  >  0}  is  the  recession 
cone  of  conv(S).  Defining  Pt  :=  con v(S(di))  +  C  (all  P-iS  have  the  same  recession 
cone  C),  it  is  straightforward  to  obtain  the  desired  compact  formulation  Y  as 
defined  in  (4). 


3.  Lift-and-Project 

In  this  section,  we  consider  mixed  0,1  linear  programs.  These  are  mixed  integer 
linear  programs  where  the  integer  variables  are  only  allowed  to  take  the  values 
0  or  1.  It  will  be  convenient  to  write  mixed  0,1  linear  programs  in  the  form 
min  cx 
Ax  >  b 

Xj  €  {0, 1}  for  j  =  1, . . . ,  n 

Xj  >  0  for  j  =  n  +  1, . . . ,  n  +  p, 

where  the  matrix  A  £  Qmx(n+p\  the  row  vector  c  €  Q"+p  and  the  column  vector 
b  £  Qm  are  data,  and  x  £  Rn+P  is  a  column  vector  of  variables. 

Consider  the  polyhedron  P  :=  {x  £  R"+p  :  Ax  >  fo}  and  the  mixed  0,1 
linear  set  S  :=  {x  £  {0, 1}"  x  Rp  :  Ax  >  b}.  The  set  conv(S)  is  a  polyhedron 
and,  ideally,  we  would  like  to  have  its  linear  description  in  the  form  conv(S')  = 
Dx  >  d.  Constructing  the  description  Dx  >  d  would  reduce  the  solution  of 
a  mixed  0,1  linear  program  to  that  of  a  linear  program.  But  this  goal  is  too 
ambitious  in  general  as  the  number  of  inequalities  needed  in  the  description  Dx  > 
d  is  typically  enormous.  This  is  not  surprising  considering  that  mixed  0,1  linear 
programming  is  NP-harcl!  A  more  reasonable  goal  is  to  obtain  an  intermediate 
set  between  P  and  conv(S')  over  which  one  can  optimize  a  linear  function  in 
polynomial  time,  and  then  to  use  recursion  to  get  tighter  approximations  of 
conv(S'). 

Sherali  and  Adams  [54],  Lovasz  and  Schrijver  [43]  and  Balas,  Ceria  and 
Cornuejols  [7]  propose  an  approach  for  doing  this  which  generates  intermedi¬ 
ate  sets  Q  between  P  and  conv(5')  as  projections  of  higher  dimensional  sets 
that  have  a  polynomial  description.  The  polyhedron  P  C  K”+p  is  first  lifted 
into  a  higher  dimensional  space  ffi>Il+P+9  where  the  formulation  is  strengthened. 
This  strengthened  formulation  is  then  projected  back  onto  the  original  space 
Kn+P,  thus  defining  Q.  In  this  process  the  constraints  of  the  higher  dimensional 
formulation  are  defined  explicitly  whereas  those  of  Q  are  only  known  implic¬ 
itly  through  projection,  thus  allowing  Q  to  have  a  nonpolynomial  number  of 
constraints.  This  approach  is  known  under  the  name  of  lift-and-project. 

3.1.  The  lift-and-project  relaxation 

Consider  a  polyhedron  P  :=  {x  £  R"+p  :  Ax  >  b}  and  the  mixed  0,1  set 
S  :=  {x  £  {0,1}"  x  Rp  :  Ax  >  5}.  Without  loss  of  generality,  we  assume 
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that  the  constraints  Ax  >  b  include  Xj  >  0  for  j  =  1, . . .  ,n  +  p,  and  Xj  <  1 
for  j  =  1  Balas,  Ceria  and  Cornuejols  [7]  study  the  following  “lift-and- 

project”  procedure: 

Step  0:  Select  j  G  {1 , . . . ,  n}. 

Step  1:  Generate  the  nonlinear  system  Xj(Ax  —  b)  >  0,  (1  —  Xj)(Ax  —  b)  >  0. 
Step  2:  Linearize  the  system  by  substituting  yi  for  XjXj,  i  ^  j,  and  Xj  for 
x2.  Call  this  polyhedron  Mj. 

Step  3:  Project  Mj  onto  the  x-space.  Let  Pj  be  the  resulting  polyhedron. 

S  C  Pj  follows  from  the  fact  that,  for  any  x  €  S,  we  have  (x,y)  G  Mj  by 
choosing  yi  =  XiXj  for  i  ^  j  since  x 2  =  Xj  (this  holds  because  Xj  is  a  0,1  variable). 
We  also  have  Pj  C  P  since  Ax  >  b  is  obtained  by  adding  the  constraints  defining 
Mj.  How  tight  is  the  relaxation  Pj  of  S  compared  to  the  initial  relaxation  PI 
The  next  theorem  shows  that  it  is  tightest  possible  among  the  relaxations  that 
ignore  the  integrality  of  all  the  variables  Xi  for  i  ^  j. 

Theorem  5.  Pj  =  conv{(Ax  >  b,  Xj  =  0)  U  (Ax  >  b ,  Xj  =  1)} 


Fig.  8.  Illustration  of  Theorem  5 


Proof.  Call  P*  the  set  conv{(7h;  >  b,  Xj  =  0)  U  (Ax  >  b,Xj  =  1)1. 

First  we  show  Pj  CP*. 

We  assume  P  /  0  since  otherwise  the  result  is  trivial. 

If  Pn{xj  =  0}  =  0,  then  P*  =  Pn{xj  =  1}.  We  already  know  that  Pj  C  P. 
Thus,  to  show  that  Pj  C  P*,  it  suffices  to  show  that  Pj  C  { Xj  =  1},  i.e.  that 
Xj  >  1  is  valid  for  Pj.  Let  e  :=  min  {xj  :  x  G  P}.  Since  Pd{xj  =  0}  =  0,  we  have 
e  >  0.  The  inequality  Xj  >  e  is  valid  for  P  and,  by  Farkas’s  lemma,  it  is  implied 
by  a  nonnegative  combination  of  rows  of  Ax  >  b.  Therefore  (1  —Xj)xj  >  (1  —  Xj)e 
is  valid  for  the  nonlinear  system  of  Step  1.  Step  2  replaces  x 2  by  Xj.  This  gives 
that  Xj  >  1  is  valid  for  Pj . 

Similarly  if  P  fl  {xj  =  1}  =  0,  we  get  Pj  C  P*. 

Now  assume  P  H  {xj  =  0}  ^  0  and  P  ft  {x.j  =  1}  ^  0.  Take  ax  >  (3  valid 
for  P* .  Since  it  is  valid  for  II  :=  P  n  {x  :  Xj  <  0}  we  can  find  a  A  such  that 
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ax  +  A Xj  >  (3  is  valid  for  P  (Lemma  1).  Similarly,  we  can  find  /i  such  that 
ax  +  p(  1  —  Xj)  >  (3  is  valid  for  P. 

Therefore  (1  —  Xj)(ax  +  A  Xj  —  P)  >  0  and  Xj{ax  +  /i(  1  —  Xj)  —  P)  >  0  are 
valid  for  the  nonlinear  system  of  Step  1,  and  their  sum  is  too: 

ax  +  (A  +  n)(xj  —  x 2)  —  P  >  0. 

Step  2  replaces  x2j  by  Xj.  This  gives  ax  >  P  valid  for  Mj,  and  thus  for  Pj.  This 
completes  the  proof  that  Pj  C  P* . 

Now  we  show  P*  C  Pj.  We  assume  P*  0  since  otherwise  the  result  is 
trivial.  Let  x  be  a  point  in  P  D  {xj  —  0}  or  in  PH  {xj  =  1}.  Define  yi  =  xiXj  for 
i  ^  j.  Then  ( x,y )  €  Mj  since  x 2  =  Xj.  So,  x  €  Pj.  By  convexity  of  Pj  it  follows 
that  P*  C  Pj.  □ 

The  set  n”=1Pj  is  called  the  lift-and-project  closure.  It  is  a  better  approxi¬ 
mation  of  conv(S')  than  P: 


conv(S)  C  n]=1Pj  C  P. 

How  much  better  is  it  in  practice?  Bonami  and  Minoux  [18]  performed  computa¬ 
tional  experiments  (see  also  Bonami’s  dissertation  [15]).  On  35  mixed  0,1  linear 
programs  from  the  MIPLIB  library  [13],  they  found  that  the  lift-and-project 
closure  reduces  the  integrality  gap  by  37  %  on  average  (the  integrality  gap  is  the 
difference  between  the  objective  value  optimized  over  conv(S')  and  over  P). 


Sherali  and  Adams  [54]  define  a  stronger  relaxation  by  skipping  Step  0  and 
considering  the  nonlinear  constraints  Xj{Ax  —  b)  >  0  and  (1  —  Xj)(Ax  —  b)  >  0 
for  all  j  =  1, . . . ,  n  in  Step  1.  Then,  in  Step  2,  variables  yt]  are  introduced  for  all 
i  =  1, . . . ,  n+p  and  j  =  1, . . . ,  n  with  i  >  j.  Note  that  the  size  of  the  linear  system 
generated  in  Step  2  is  much  larger  than  in  the  previous  lift-and-project  procedure 
(  An-i)  _|_  np  new  variates  and  2 mn  constraints,  instead  of  just  n  +  p  —  1  new 
variables  and  2m  constraints  before).  Let  N  denote  the  resulting  polyhedron 
obtained  by  projection  in  Step  3.  Clearly,  the  Sherali- Adams  relaxation  N  is 
at  least  as  strong  as  the  lift-and-project  closure  defined  above,  and  it  can  be 
strictly  stronger  since  the  Sherali- Adams  procedure  takes  advantage  of  the  fact 
that  tjij  =  yji  whereas  this  is  not  the  case  for  the  lift-and-project  closure  fl”=1Pj. 
How  much  better  is  the  Sherali- Adams  relaxation  in  practice?  We  turn  again  to 
Bonami  and  Minoux  [18].  For  the  19  MIPLIB  intances  for  which  they  could 
compute  the  Sherali-Adams  bound  within  an  hour,  the  improvement  over  the 
lift-and-project  bound  was  10  %  on  average.  Specifically,  on  these  19  instances, 
the  average  integrality  gap  closed  went  from  29  %  for  the  lift-and-project  closure 
to  39  %  for  the  Sherali-Adams  relaxation.  An  even  stronger  relaxation  can  be 
obtained  as  follows: 
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3.2.  The  Lovasz-Schrijver  relaxation 

Step  1:  Generate  the  nonlinear  system  Xj(Ax—b)  >  0  and  (1  —  Xj)(Ax— b)  >  0 
for  all  j  =  1, . . . ,  n. 

Step  2:  Linearize  the  system  by  substituting  for  XiXj,  for  all  i  =  1, ... ,  n+ 
p,  j  =  1 , ...  ,n  such  that  j  <  i,  and  Xj  for  x |  for  all  j  =  1, . . . ,  n.  Denote  by  Y 
the  symmetric  (n  +  1)  x  (n  +  1)  matrix  with  the  vector  (l,xi, . . .  ,xn)  in  row 
0,  in  column  0  and  in  the  diagonal,  and  entry  yij  in  row  i  and  column  j  for 

n(n-l)  | 

i.  j  =  1, . . . ,  n  and  j  <  i.  Call  M+  the  convex  set  in  R+  2  of  all  (x,  y)  that 
satisfy  the  above  linear  inequalities  and  such  that  Y  is  a  positive  semidefinite 
matrix. 

Step  3:  Project  M+  onto  the  x-space.  Call  N+  the  resulting  convex  set. 


In  this  procedure,  the  fact  that  we  get  a  relaxation  N+  instead  of  the  set 


S  itself  is  because  we  replace  the  constraint  Y 


by  “Y  positive 


semidefinite”  in  Step  2. 

We  have  conv(5)  C  N+  C  TV  C  n"_1Pj  C  P. 

The  convex  set  N+  is  not  a  polyhedron  in  general.  Nevertheless,  a  major 
interest  in  the  above  Lovasz-Schrijver  procedure  [43]  is  due  to  the  fact  that  a 
linear  function  can  be  optimized  over  N+  in  polynomial  time.  Indeed,  optimizing 
a  linear  function  over  the  constraint  set  defined  in  Step  2  is  a  semidefinite  pro¬ 
gram :  linear  objective,  linear  constraints  plus  a  symmetric  positive  semidefinite 
constraint  on  the  matrix  Y  of  variables.  Semidefinite  programs  can  be  solved  in 
polynomial  time  by  interior  point  algorithms  [48]. 

How  tight  is  the  Lovasz-Schrijver  relaxation  N+1  From  a  practical  perspec¬ 
tive  the  size  of  the  semidefinite  program  creates  a  tremendous  challenge:  the 
number  of  variables  has  been  multiplied  by  n  and  the  number  of  constraints 
as  well!  Burer  and  Vandenbussche  [19]  solve  it  using  an  augmented  Lagrangian 
method  and  they  report  computational  results  on  three  classes  of  combinatorial 
problems,  namely  the  maximum  stable  set  problem,  the  quadratic  assignment 
problem  and  the  following  problem  of  Erdos  and  Turan:  Calculate  the  maximum 
size  of  a  subet  of  numbers  in  {1, . . . ,  n}  such  that  no  three  numbers  are  in  arith¬ 
metic  progression.  In  all  three  cases,  the  Lovasz-Schrijver  bound  given  by  N+ 
is  substantially  tighter  than  the  Sherali- Adams  bound  given  by  N.  To  illustrate 
this,  we  give  below  the  results  obtained  in  [19]  for  the  size  of  a  maximum  stable 
set  (graphs  with  more  than  100  nodes): 


Name 

nodes  edges 

optimum 

N+ 

N 

brock200-l 

200  5066 

21 

27.9 

66.6 

c-fat200-l 

200  18366 

12 

14.9 

66.6 

johnsonl6-2-4 

120  1680 

8 

10.2 

23.3 

keller4 

171  5100 

11 

15.4 

57.0 

rand-200-05 

200  982 

64 

72.7 

75.1 

rand-200-50 

200  10071 

11 

17.1 

66.6 
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From  a  theoretical  point  of  view,  Goemans  and  Williamson  [33]  proved  a 
striking  result.  They  showed  that,  for  the  max-cut  problem,  the  semidefinite 
relaxation  is  never  more  than  14%  above  the  optimum  value. 

Is  it  possible  to  improve  upon  the  Lovasz-Schrijver  relaxation  in  polynomial 
time?  The  answer  is  yes.  See  for  example  Lasserre  [40],  Laurent  [41],  and  Bi- 
enstock  and  Zuckerberg  [12].  These  relaxations  are  even  more  computationally 
demanding  than  N+.  We  do  not  discuss  them  here.  Instead  we  return  to  relax¬ 
ations  that  can  yield  cutting  planes  in  a  reasonably  short  amount  of  computing 
time. 


3.3.  Lift-and-project  cuts 

In  this  section  we  return  to  the  lift-and-project  relaxation  P3 .  Optimizing  a  linear 
function  over  Pj  amounts  to  solving  a  linear  program.  So  does  the  problem  of 
generating  a  valid  inequality  for  Pj  that  cuts  off  a  given  point  x,  or  showing 
that  none  exists,  as  we  explain  below.  Step  2  of  the  lift-and-project  procedure 
in  Section  3.1  constructs  the  following  polyhedron  Mj. 

Mj  :=  {x  €  R"+p,  y  £  R"+p  :  Ay  —  bxj  >  0,  Ax  +  bxj  —  Ay  >  b,  yj  =  Xj}. 

The  first  two  constraints  come  from  linearizing  the  inequalities  of  Step  1.  Actu¬ 
ally,  the  variable  yj  is  not  introduced  explicitly  in  Step  2.  So,  in  fact,  Mj  is  the 
polyhedron  in  R"+p  x  R"+p  1  obtained  by  identifying  the  variables  yj  =  Xj  in 
the  above  set.  Let  Aj  be  the  m  x  (n+p—  1)  matrix  obtained  from  A  by  removing 
its  j- th  column  a3 .  We  have 

Mj  :=  {x  €  R"+P,y  £  :  Ajy+(a3  —b)xj  >  0,  Ax+(b— aP)xj  —  Ajy  >  6}. 

By  renaming  the  coefficient  matrices  of  x,  we  get 

Mj  =  {x  £  R"+p,  y  £  R[[.+p_1  :  BjX  +  Ajy  >  0,  Ajx  —  Ajy  >  6} 

We  want  to  project  out  the  y  variables.  This  is  done  using  Theorem  2.  The 
appropriate  cone  is 

Q  :=  {(u,  v)  :  uAj  —  vAj  =  0,  u>0,v>0}. 

Namely  the  set  Pj  can  be  written  as: 

Pj  =  {x  £  R"+p  :  (uBj  +  vAj)x  >  vb  for  all  (u,  v )  £  Q}. 

Given  a  fractional  solution  x,  we  can  now  express  that  an  inequality  ax  >  (3  is 
valid  for  Pj  and  cuts  off  x.  Indeed  a  =  uBj  +  vAj  and  (3  =  vb  for  (u,  v)  £  Q  ex¬ 
presses  the  validity  for  Pj  and  ax  <  (3  expresses  that  it  is  a  cut.  To  get  a  deepest 
cut,  one  can  solve  the  following  linear  program,  the  so-called  cut  generating  LP: 

max  vb  —  ( uBj  +  vAj)x 
uAj  —  vAj  =  0 
u  >  0,  v  >  0. 
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These  constraints  along  with  a  normalization  constraint  to  truncate  the  cone 
will  do.  For  example,  we  could  add  the  constraint  X]  ut  +  Ylvi  =  1.  Therefore, 
the  cut  generating  LP  has  2m  variables  and  n  +  p  constraints  other  than  the 
nonnegativity  conditions.  This  is  a  fairly  large  linear  program  to  solve  just  to 
generate  one  cut.  Various  tricks  have  been  used  to  speed  up  the  solution,  such  as 
working  in  the  subspace  of  variables  where  Xj  >  0  for  j  =  1 , ,n+p  and  Xj  <  1 
for  j  =  1 , ...  ,n,  see  [7].  Is  it  possible  to  generate  lift-and-project  cuts  without 
explicitly  formulating  and  solving  the  cut  generating  LP?  Balas  and  Perregaard 
[10]  give  a  positive  answer.  We  explain  this  below. 


Instead  of  expressing  Pj  as  the  projection  of  Mj ,  it  is  also  possible  to  ex¬ 
press  Pj  using  Theorems  5  and  4  and  then  projecting  onto  the  x-space.  This  is 
equivalent  of  course.  By  Theorem  5,  Pj  is  the  convex  hull  of  the  union  of  two 
polyhedra: 

Ax  >  b  Ax  >  b 

x  >  0  and  x  >  0 

— Xj  >0  Xj  >  1. 

In  this  part,  we  assume  that  the  inequalities  Ax  >  b  contain  —  Xj  >  —1  for 
j  =  1, . . . ,  n,  but  not  x  >  0.  By  Theorem  4, 


Pj  =  Pr°ja;  \ 


Ax°  >  byo 

— Xj  >  0 

Ax1  >  by  i 
x)  >  j/i 
x°  +  x1  =  x 


2/o  +  2/i  =  1 
.  X,  X°,  x1, 2/0, 2/1  >  o. 


Let  ej  denote  the  j- th  unit  vector.  Using  the  projection  theorem  (Theorem  2), 
we  get  that  Pj  is  defined  by  the  inequalities  ax  >  (3  such  that 

a  —uA  +uoej  >  0 

a  —vA  —vo ej  >  0 

(3  — ub  <  0 

(3  —vb  —vo  <  0 

U,  Uo,  V,  Vo  >  0. 

Adding  a  normalization  constraint,  we  obtain  the  cut  generating  LP: 


min  ax 
a 
a 


—uA  +uo&j 

>  o 

—vA 

—vo  ej  >  0 

[3  —ub 

<  0 

P 

—vb 

-Vo  <  o 

L*= i  ui 

+u0  +  YJiLl  Vi 

+v0  =  1 

u, 

Uo,  V, 

Vo  >  o 

(5) 
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Balas  and  Perregaard  [10]  give  a  precise  correspondence  between  the  basic 
feasible  solutions  of  (5)  and  the  basic  solutions  (possibly  infeasible)  of  the  usual 
LP  relaxation 

(R)  min{cx  :  Ax>b,x>  0}. 


Fig.  9.  Correspondence  between  basic  solutions  and  lift-and-project  cuts 


A  geometric  view  of  this  correspondence  may  be  helpful:  The  n  +  p  extreme 
rays  emanating  from  a  basic  solution  of  (R)  intersect  the  hyperplanes  Xj  =  0  and 
Xj  =  1  in  n  +  p  points  (some  of  these  points  may  be  at  infinity) .  These  points 
uniquely  determine  a  hyperplane  ax  =  (3  where  (a,  (3)  are  associated  with  a 
basic  feasible  solution  of  the  cut  generating  LP  (5).  For  example,  in  Figure  9, 
cut  1  corresponds  to  the  basic  solution  1  of  (R)  and  cut  2  corresponds  to  the 
basic  (infeasible)  solution  2  of  (R). 

Using  the  correspondence,  Balas  and  Perregaard  show  how  simplex  pivots  in 
the  cut  generating  LP  (5)  can  be  mimicked  by  pivots  in  (R).  The  major  practical 
consequence  is  that  the  cut  generating  LP  (5)  need  not  be  formulated  and  solved 
explicitly.  A  sequence  of  increasingly  deep  lift-and-project  cuts  can  be  computed 
by  pivoting  directly  in  (R).  We  elaborate  on  these  pivoting  rules  in  Section  4.3. 


3A-  Strengthened  lift-and-project  cuts 

Again  we  consider  the  mixed  0,1  linear  set  S  :=  {x  £  {0,  l}nxRJ]_  :  Ax  >  b}.  We 
assume  that  the  constraints  Ax  >  b  contain  —Xj  >  —  1  for  j  =  1, . . .  ,  n,  but  not 
x  >  0.  The  cut  generating  LP  (5)  produces  a  lift-and-project  inequality  ax  >  (3 
that  is  valid  for  Pj.  The  derivation  only  uses  the  integrality  of  variable  Xj,  not 
of  the  variables  Xk  for  k  =  1, . . . ,  n  and  k  ^  j.  Balas  and  Jeroslow  [9]  found  a 
simple  way  to  use  the  integrality  of  the  other  variables  to  strengthen  the  lift-and- 
project  cut.  This  strengthening  has  the  nice  property  that  it  is  straightforward 
to  implement  once  the  cut  generating  LP  (5)  has  been  solved. 
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Note  that,  given  u,Uo,v,Vo,  the  optimal  values  of  and  (3  in  (5)  are: 

_  J  ma x(uak ,vak)  for  k  ^  j 

ak  \  ma x(m3'  —  uq,  va ?  +  no)  for  k  =  j,  '  ' 

where  ak  denotes  the  fc-th  column  of  A ,  and 

/ 3  =  min(it&,  vb  +  uq). 

To  strengthen  the  inequality  ax  >  (3 ,  one  can  try  to  decrease  the  coefficients 
ak-  Balas  and  Jeroslow  [9]  found  a  way  to  do  just  that  by  using  the  integrality 
of  the  variables  Xk  for  k  =  1, . . . ,  n. 

Theorem  6.  (Balas  and  Jeroslow  [9])  Let  x  satisfy  Ax  >  b,  x  >  0.  Given  an 

k  k 

optimal  solution  u,u o,v,Vo  of  the  cut  generating  LP  (5),  define  m.k  =  "“pj™  , 

_  J  min(uor  +  uo  \m.k] ,  vak  —  vq  \rrik\ )  for  k  =  1, . . . ,  n 
ak  (  ma x(uak,vak)  for  k  =  n  +  1, . . .  ,n  +  p 

and  (3  =  min(w&,  vb+  u o).  Then  the  inequality  ax  >  (3  is  valid  for  conv(S). 

Proof.  For  ir  £  Zn,  the  following  disjunction  is  valid  for  conv(S'): 

n  n 

either  E  7 TfeXfc  >0  or  -  ^7 TkXk  >  1. 

k= 1  k—\ 

Let  us  repeat  the  derivation  of  (5)  with  this  disjunction  in  place  of  —Xj  >  0  or 
Xj  >  1  as  before.  We  consider  the  union  of 


Ax  >  b 
x  >  0  and 

YJl=  l  ^kXk  >  0 


Ax  >  b 
x  >  0 

-  i  T kXk  >  1. 


Using  Theorem  4  and  the  projection  theorem  (Theorem  2),  we  get  that  any 
inequality  ax  >  (3  that  satisfies 

a  ~uA  -u0(ELi  Kkek)  >  0 

a  -vA  +u0(Efc=i  ^fcefc)  >  0 

/ 3  —ub  <  0 

/ 3  — vb  — vq  <  0 

U,  Uq,  V,  Vq  >  0 


is  valid  for  conv(S').  We  can  choose  u,uo,v,vo  to  be  an  optimal  solution  of  the 
original  cut  generating  LP  (5).  This  implies  that,  for  k  =  1, . . . ,  n,  we  can  choose 
Ofc  =  max(uafc  +  uo'n’k^vak  —  Vq7 Tfc).  Smaller  coefficients  ak  produce  stronger 
inequalities  since  the  variables  are  nonnegative.  What  is  the  best  choice  oink  £  % 
to  get  a  small  afc?  It  is  obtained  by  equating  uak  +  uonk  and  vak  —  Vonk,  which 
yields  the  value  in  the  statement  of  the  theorem  (both  uo  and  vq  are  strictly 
positive  since  otherwise  ax  >  (3  is  valid  for  P ,  contradicting  that  it  is  a  cut  for 
x),  and  then  rounding  this  value  either  up  or  down  since  nk  must  be  integer. 
The  best  choice  is  the  minimum  stated  in  the  theorem.  □ 
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Bonami  and  Minoux  [18]  found  that  applying  the  Balas-Jeroslow  strength¬ 
ening  step  improves  the  average  gap  closed  by  an  additional  8  %,  as  compared 
to  the  lift-and-project  closure,  on  the  35  MIPLIB  instances  in  their  experiment. 
Specifically,  the  integrality  gap  closed  goes  from  37  %  to  45  %.  The  time  to 
perform  the  strengthening  step  is  negligible. 


3.5.  Sequential  convexification 

Theorem  7.  (Balas  [5])  P„(P„_i(. . .  P2(Pi)  •••))  =  conv(S). 

Proof.  By  induction.  Let  St  :=  {x  €  {0,1}*  x  R"_*+p  :  Ax  >  5}.  We  want  to 
show  Pt(Pt- 1(. . .  P2(Pi) . . .))  =  conv(S't).  This  is  true  for  t  =  1  by  Theorem  5 
applied  to  j  =  1,  so  consider  t  >  2.  Suppose  that  this  is  true  for  t  —  1.  By  the 
induction  hypothesis  we  have 


Pt(Pt- r(-  •  •  P2{Pi)  •••))=  Pt{c°nv{St-i)) 


and  by  Theorem  5  applied  to  j  =  t,  this  is 

=  conv(conv(5t_i)  fl  {xt  =  0})  U  (conv(5t_i)  fl  {xt  =  1}). 

For  any  set  S  that  lies  entirely  on  one  side  of  a  hyperplane  H ,  Lemma  2  states 
that 

conv(5)  fl  H  =  con ,v(S  D  H ). 

Therefore  conv(5t_i)fl{a;t  =  0}  =  conv(5t_ifl{xt  =  0})  and  conv(S}_i)n{:rt  = 
1}  =  conv(5t_i  fl  {xt  =  1}).  Thus 

Pt(Pt- 1(-  ■■P2(Pi)  ■■■))  =  conv(conv(5t_i  fl  {xt  =  0}))U(conv(S't_in{a;t  =  1})) 
=  conv((St_i  D  {xt  =  0})  U  (St- 1  n  {xt  =  1}))  =  conv(S't).  □ 


3. 6.  Rank 

Let  P1  denote  the  lift-and-project  closure  relative  to  P,  i.e. 

n 

P'  :=  fl  p,. 

3= 1 

For  k  >  2,  let  Pk  denote  the  lift-and-project  closure  relative  to  Pfc_1.  The 
set  Pk  is  a  polyhedron.  It  is  called  the  k-th  lift-and-project  closure  relative  to 
P.  A  valid  inequality  for  Pk  but  not  Pfc_1  is  said  to  be  of  rank  k.  The  lift-and- 
project  rank  of  P  is  the  smallest  integer  such  that  Pk  =  conv(S').  It  follows  from 
Theorem  7  that  the  lift-and-project  rank  of  P  is  at  most  n. 

Similarly,  N+  is  the  Lovasz-Schrijver  closure  relative  to  P,  and  the  k-th 
Lovasz-Schrijver  closure  Nk  is  defined  iteratively.  The  Lovasz-Schrijver  rank 
relative  to  P  is  the  smallest  k  such  that  TV"}  =  conv(S').  Since  N+  C  P1,  the 
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Lovasz-Schrijver  rank  is  at  most  n.  Cook  and  Dash  [24]  and  Goemans  and  Tungel 
[32]  give  examples  showing  that  the  Lovasz-Schrijver  rank  can  be  equal  to  n. 

Let  P  :=  {x  £  [0, 1]"  :  ^j=i  xj  >  5}  and  S  :=  Pfl Z".  Here  conv(5)  =  {x  £ 
[0,  1]"  :  1  xj  —  !}•  Cook  and  Dash  show  that  the  point  (2J_fc, . . . ,  2n-k) 

belongs  to  N+  for  k  <  n.  Note  that  this  point  does  not  belong  to  conv(S)  for 
k  <  n,  showing  that  N*  ^  conv(S')  for  k  <  n. 

Another  example  is  the  following:  P  :=  {x  £  R"  :  J2jGjXj  +  ~ 

Xj)  >  1  for  all  J  C  {l,...,n}}  and  S  :=  P  C\  {0,1}".  Observe  that  5  =  0. 
However,  Goemans  and  Tungel  and  independently  Cook  and  Dash  show  that 

(§>■■■,§)  eivr1- 

4.  Mixed  Integer  Inequalities 

In  this  section,  we  return  to  mixed  integer  linear  sets  5  :=  {(x,y)  £  Z"  x  : 
Ax+Gy  <  b}  where  the  variables  Xj  can  take  general  nonnegative  integer  values, 
instead  of  just  0  or  1  in  the  previous  section. 

We  present  four  families  of  valid  inequalities  for  conv(5):  Gomory  mixed 
integer  inequalities,  mixed  integer  rounding  inequalities,  split  inequalities,  and 
intersection  inequalities.  We  compare  the  resulting  elementary  closures  and  ad¬ 
dress  computational  issues. 

We  do  not  address  the  general  theory  of  superadditive  functions  [36]  in  this 
tutorial.  The  interested  reader  is  referred  to  [46]  pages  247-253  for  an  introduc¬ 
tion  to  superadditivity  applied  to  mixed  integer  linear  sets. 


4-1.  Gomory ’s  derivation 


Let  us  consider  a  mixed  integer  linear  set  comprising  a  single  equality  constraint: 

n  p 

S  :=  {(z,  y)  £  Z;xf+:  ^  ajXj  +  g^y^  =  b}. 

i= 1  i= 1 

Let  b  =  [b\  +  fo  where  0  <  /o  <  1. 

Let  aj  =  \_aj\  +  fj  where  0  <  fj  <  1.  Then 

p 

(fi -1)xj  +  '529jVj  =  k  +  f0 

fj<fo  fj>fo  i=1 


where  k  is  some  integer.  Since  k  <  —1  or  k  >  0,  any  x  £  S  satisfies  the  disjunction 


V  ffxj  -  V  ^-ri^-Xj  +  V  71 Vj  ^  1 

/o  /,>/o  /o  hfo 


1  -fj 


(7) 


OR 


53  1  -f0Xj  +  ^  1  -  Jf0Xj  5Z  1  ^f  Vj  ^  L 

fj<fo  fj>fo  -/U  J=1  -/U 


1  -fj. 


(8) 
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This  is  of  the  form  a1z  >  1  or  a2z  >  1  which  implies  JE  max(aj ,  aj)zj  >  1  for 
any  z  >  0.  For  each  variable,  what  is  the  maximum  coefficient  in  (7)  and  (8)? 
The  answer  is  easy  since  one  coefficient  is  positive  and  the  other  is  negative.  We 
get 


E 

fi<fo 


+  E 

fj>fo 


(9) 


We  have  just  proved  that  inequality  (9)  is  valid  for  S.  This  is  the  Gomory  mixed 
integer  inequality  (GMI  inequality)  [34]. 


Remark  8.  In  the  pure  integer  programming  case,  the  GMI  inequality  reduces  to 


E  T 

fj<f 0 


E  2  1 


f°  ‘  '  /^/o1_/° 


Since  when  fj  >  /o,  it  follows  that  the  GMI  inequality  dominates 


X!  fixi  >  /o. 

j= i 

which  is  known  as  the  fractional  cut.  The  fractional  cut  can  also  be  derived  using 
Chvatal’s  procedure  [21]. 


Consider  now  mixed  integer  linear  sets  with  m  inequality  constraints,  instead 
of  one  equality  constraint  above,  S  :=  {(x,y)  €  Z"  x  R+  :  Ax  +  Gy  <  &}.  Let 
P  :=  {(a :,y)  t  I"  x  lp  :  Ax  +  Gy  <  b,  x  >  0,y  >  0}  denote  the  underlying 
polyhedron.  If  P  =  0  then  S'  =  0  and  we  are  done.  Therefore  we  may  assume  that 
P^0.  Let  ax  +  ^y  <  (3  be  any  valid  inequality  for  P  (Recall  that,  by  Farkas’s 
lemma,  the  valid  inequalities  for  P  are  of  the  form  uAx  +  uGy  —  vx  —  wy  <  ub  +  t 
where  u  G  R+,  v  €  R",  w  G  M^.,  <  G  M+).  Add  a  nonnegative  slack  variable 
ax  +  'yy  +  s  =  /?,  derive  a  GMI  inequality  (9)  from  this  equation,  and  eliminate 
s  =  b  —  ax  —  yy  from  this  inequality.  The  result  is  a  valid  inequality  for  S,  in  the 
space  Rra  x  Rp  of  the  variables  x,  y.  Let  us  call  these  inequalities  GMI  inequalities 
for  S.  The  Gomory  mixed  integer  closure  is  obtained  from  P  by  adding  all  the 
GMI  inequalities  for  S. 

In  the  previous  section,  we  saw  that  one  can  optimize  in  polynomial  time  over 
the  lift-and-project  closure,  the  Sherali-Adams  closure  and  the  Lovasz-Schrijver 
closure.  A  natural  question  is  whether  the  same  is  true  of  the  Gomory  mixed 
integer  closure.  It  turns  out  that  the  situation  is  totally  different:  It  is  NP-hard 
to  optimize  a  linear  function  over  the  Gomory  mixed  integer  closure  relative  to  a 
polyhedron  P  (Caprara  and  Letchforcl  [20],  Cornuejols  and  Li  [28]).  Equivalently, 
given  a  point  (x,  y)  G  P  \  S,  it  is  NP-hard  to  find  a  GMI  cut  with  respect  to 
(x,  y )  or  show  that  none  exists.  A  similar  NP-hardness  result  was  proved  earlier 
by  Eisenbrand  [30]  for  the  Chvatal  closure  [21]. 

Note  that  this  is  in  contrast  with  the  problem  of  finding  a  GMI  cut  that  cuts 
off  a  basic  solution  (x.  y)  G  P\S.  Indeed,  any  row  of  the  simplex  tableau  where 
Xj  is  fractional  for  some  j  =  1 ,n  can  be  used  to  generate  a  GMI  cut.  We 
illustrate  this  in  the  next  section. 
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4-2.  Gomory  mixed  integer  cuts  from  the  simplex  tableau 


We  illustrate  the  application  of  GMI  cuts  on  a  small  example. 


max  x  +2  y 

—x  +y  <  2 

x  +y  <  5 

2x  —  y  <  4 


x  £  Z+,  y  £  R+. 


Acid  nonnegative  slacks  si,s2,s3  and  solve  the  LP  relaxation  max  2  where 

2  —x  —2  y  =  0 

—x  +y  +Si  =  2 

x  +y  +s2  =  5 

2x  -y  +s3  =  4. 

The  optimal  tableau  is 

2  +0.5si  +1.5s2  =  8.5 

y  +0.5si  +0.5s2  =  3.5 

x  -0.5si  +0.5s2  =  1.5 

0.5si  — 0.5s2  +s3  =  4.5 

and  the  corresponding  solution  is  x  =  1.5,  y  =  3.5.  Since  x  is  not  integer,  we 
generate  a  cut  from  the  row  in  which  x  is  basic,  namely 

x  —  0.5si  +  0.5s2  =  1.5 

Here  /o  =  0.5  and  applying  the  GMI  formula,  we  get 


si  +  s2  >  1. 
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Since  si  +  S2  =  7  —  2 y,  we  can  express  the  cut  in  the  ( x ,  y)-space: 

y  <  3. 

Adding  this  cut  and  solving  the  strengthened  LP  relaxation  gives  the  solution 
x  =  2,  y  =  3.  Since  x  is  integer  in  this  solution,  it  is  the  optimal  solution  of  the 
mixed  integer  program. 

Gomory  [34]  introduced  GMI  cuts  in  the  early  60s  but,  for  decades,  they 
were  not  used  in  commercial  integer  programming  software.  Most  textbooks 
from  the  80s  considered  them  impractical  for  various  reasons.  They  were  revived 
in  the  90s  when  Balas,  Ceria,  Cornuejols  and  Natraj  [8]  demonstrated  that  a 
branch-and-cut  algorithm  based  on  GMI  cuts  was  practical  and  superior  to  state- 
of-the-art  algorithms  of  the  time.  GMI  cuts  turn  out  to  be  surprisingly  good 
in  practice  (Bixby,  Gu,  Rothberg,  Wunderling  [14]).  On  41  MIPLIB  instances, 
adding  the  GMI  cuts  generated  from  the  optimal  simplex  tableau  reduces  the 
integrality  gap  by  24  %  on  average  [17].  Greater  improvements  can  be  obtained 
by  performing  several  rounds  of  cutting.  A  round  consists  of  solving  the  current 
linear  programming  relaxation,  generating  a  GMI  cut  for  each  basic  variable 
Xi  with  a  fractional  value  Xi  in  the  current  solution,  and  updating  the  linear 
program  by  adding  these  cuts.  Since  GMI  cuts  from  the  simplex  tableau  are 
extremely  easy  to  generate  and  effective  in  reducing  the  integrality  gap,  they  are 
widely  used  in  commercial  codes  today.  Numerical  issues  need  to  be  addressed, 
of  course:  in  order  to  avoid  numerical  instability  (and  possibly  cutting  off  the 
optimal  solution),  one  usually  discards  cuts  that  have  too  large  a  ratio  between 
the  largest  and  smallest  coefficient;  in  order  to  avoid  fill  in  of  the  basis  inverse, 
one  also  discards  cuts  that  are  too  dense. 


4-3.  Improving  mixed  integer  Gomory  cuts  by  lift-and-project 

In  this  section  we  return  to  mixed  0-1  programming  and  the  correspondence 
between  basic  feasible  solutions  of  the  cut  generating  LP  (5)  and  basic  solutions 
(possibly  infeasible)  of  the  usual  LP  relaxation  (R)  introduced  in  Section  3.3. 
Balas  and  Perregaarcl  [10]  showed  how  this  correspondence  can  be  used  to  im¬ 
prove  Gomory  mixed  integer  cuts  by  mimicking  simplex  pivots  in  (5)  through 
pivots  in  (R).  The  simplex  tableaux  of  (5)  and  (R)  will  be  referred  to  as  large 
and  small  respectively. 

Let 

Xj  =  aj0  -  ^2  ajhxh  (10) 

heJ 

be  a  row  of  the  small  optimal  simplex  tableau  such  that  0  <  ajo  <  1.  The  GMI 
cut  from  this  row  is  equivalent  to  the  strengthened  lift-and-project  cut  from 
some  basic  feasible  solution  of  (5),  where  index  j  in  (5)  is  the  same  as  in  (10). 
To  identify  this  solution,  partition  J  into  subsets  Mi  and  M2,  such  that  h  £  Mi 
if  ajh  <  0,  and  h  £  M2  if  ajh  >  0  (h  £  J  such  that  ajh  =0  can  go  into  either 
subset).  Then  eliminating  a,  (3  from  (5),  the  n  columns  indexed  by  M 1  U  M2 


Valid  Inequalities  for  MILPs 


25 


together  with  the  two  columns  indexed  by  uq  and  Vo  define  a  feasible  basis  of 
the  resulting  system  of  n  +  2  equations.  The  strengthened  lift-and-project  cut 
associated  with  this  basic  feasible  solution  to  (5)  is  equivalent  to  the  GMI  cut 
from  (10). 

To  evaluate  the  GMI  cut  generated  from  the  small  simplex  tableau  (10)  as 
a  lift-and-project  cut,  we  calculate  the  reduced  costs  in  the  large  tableau  of  the 
nonbasic  variables  of  the  above  solution  to  (5).  Each  row  X{  of  the  small  tableau 
corresponds  to  a  pair  of  columns  of  the  large  tableau,  associated  with  variables 
m  and  Vi .  The  reduced  costs  r(iq),  r(vi)  of  these  variables  in  the  large  tableau 
are  known  simple  functions  of  the  entries  and  ajh ,  for  h  £  J,  of  rows  j  and 
i  of  the  small  tableau.  If  they  are  all  nonnegative,  the  current  large  tableau  is 
optimal,  hence  the  GMI  cut  from  (10)  cannot  be  improved.  Otherwise,  the  cut 
can  be  improved  by  executing  a  pivot  in  a  row  i  of  the  small  tableau,  such  that 
r('Ui )  <  0  or  r(vi )  <  0. 

To  identify  the  nonbasic  variable  Xk  to  replace  x.j  in  the  basis  of  the  small 
tableau,  we  calculate  for  each  h  €  J  the  objective  function  value  /(a^)  of  (5) 
resulting  from  the  corresponding  exchange  in  the  large  tableau.  This  value  is  a 
known  simple  function  of  the  ratio  CLjh/o-ih  and  of  the  coefficients  of  rows  j  and 
i  of  the  small  tableau.  Any  column  h  for  which  /(a^)  <  0  is  a  candidate  for  an 
improving  pivot,  and  the  most  negative  value  indicates  the  best  column  k. 

Executing  the  pivot  in  the  small  tableau  that  exchanges  x,  for  Xk  yields  a 
new  simplex  tableau  (whose  solution  is  typically  infeasible),  whose  j- th  row  (the 
same  j  as  before!)  is  of  the  form 


xj  =  aj0  +  tai0  -  ^2  ( ajh  +  taih)xh,  (11) 

JUi\k 

with  t  :=  ajk/cLik •  The  GMI  cut  from  (11)  is  then  stronger  than  the  one  from 
(10),  in  the  sense  that  it  cuts  off  the  LP  optimum  of  (R)  by  a  larger  amount. 

These  steps  can  then  be  repeated  with  (11)  replacing  (10)  for  as  long  as 
improvements  are  possible. 

Practical  experience  shows  that  in  about  three  quarters  of  the  cases  GMI  cuts 
from  the  optimal  simplex  tableau  can  be  improved  by  the  pivoting  procedure 
described  above.  On  the  other  hand,  improvements  beyond  10  pivots  are  not 
frequent,  and  beyond  20  pivots  they  are  very  rare. 

This  procedure  was  extensively  tested  and  has  been  incorporated  into  the 
mixed  integer  module  of  XPRESS,  with  computational  results  reported  in  [50]. 


4-4-  K-cuts  and  reduce- and- split  cuts 

What  happens  if,  instead  of  generating  a  GMI  inequality  from  an  equation 
Yl'j= i  ajxj  +  £i=i  9jVj  =  we  first  multiply  the  equation  by  k  before  gen¬ 
erating  the  GMI  inequality? 

Let  us  try  it  on  an  example. 
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max  10xi  +  13x2 

10xi  +  14x2  <  43 
Xl,x2  G  Z+. 

Let  2;  =  lOxi  +  13x2  and  introduce  a  nonnegative  integer  slack  X3.  The 
optimal  tableau  is 


z  +x2  +x3  =  43 

xi  +1.4x2  +O.IX3  =  4.3 


The  GMI  cut  is  3X2  +  jy|x3  >  1,  i.e.  18x2  +  7x3  >  21. 

If  we  multiply  the  equation  xi  +  1.4x2  +  O.IX3  =  4.3  by  k  =  2,  we  get 
2xi  +  2.8x2  +  0.2x3  =  8.6.  The  resulting  GMI  cut  is  3xi  +  2x2  >  6. 

If  we  multiply  by  k  =  3,  we  get  the  GMI  cut  2x2  +  3x3  >  9. 

If  we  multiply  by  k  =  4,  we  get  2x2  +  3x3  >  4. 

If  we  multiply  by  k  =  5,  we  get  X3  >  1. 


Clearly,  these  simple  operations  on  the  equation  xi  +  1.4x2  +  O.IX3  =  4.3 
produce  very  different  GMI  cuts.  See  Figure  11. 


More  generally,  if  we  have  an  optimal  simplex  tableau,  what  sort  of  operations 
should  we  perform  on  the  rows  in  order  to  generate  useful  GMI  cuts? 

Observe  that,  if  we  multiply  ajxj  +  J+=i  Vi  =  b  by  a  factor  k  >  1, 

the  coefficients  of  the  Xj  variables  in  the  GMI  inequality  remain  between  0  and 
1,  but  the  coefficients  of  the  yj  variables  are  multiplied  by  the  factor  fc,  which 
makes  the  inequality  weaker. 
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This  suggests  the  following  idea  (Andersen,  Cornuejols  and  Li  [2]):  Take 
linear  combinations  of  the  constraints  in  order  to  produce  equations 

n  p 

Y,aix<  ■  : b 

3= 1  3  =  1 

where  the  norm  of  the  vector  g  is  small,  and  generate  GMI  inequalities  from 
these  “reduced”  equations.  Specifically,  consider  the  rows  of  the  optimal  simplex 
tableau  corresponding  to  the  basic  variables  Xi  (but  not  the  basic  y,s): 

x,  =  x,  -  V  OijXj  -  ^2  gihyh  for  i  e  B. 

jGJ  h£H 

This  gives  a  set  of  vectors  {gilieB-  Iteratively  reduce  the  norm  of  at  least  one 
vector  ~gi  in  this  set  by  integral  combinations  of  the  others  (Integrality  is  useful 
to  guarantee  that  the  basic  variables  end  up  with  integral  coefficients  when 
combining  the  rows).  The  procedure  for  reducing  the  vectors  n  is  similar 

to  Lovasz’s  basis  reduction  algorithm  [42].  It  terminates  with  a  set  of  vectors 
with  reduced  norms.  The  corresponding  combinations  of  the  tableau 

rows  are 

nzxB  =x'i-'^2  a'ijx3  -  s'ihVh  for  i  G  B. 

j£j  h£H 

The  GMI  inequalities  generated  from  the  equations  where  x\^'L  cut  off  (x,  y) 
since  only  the  nonbasic  variables  end  up  with  nonzero  coefficients  in  these  in¬ 
equalities  (indeed,  the  vectors  tt 1  are  integral).  These  cuts  are  called  reduce- and- 
split  cuts  (the  name  comes  from  the  equivalence  between  GMI  inequalities  and 
split  inequalities,  which  we  prove  in  Theorem  10).  Computational  experiments 
reported  in  Andersen,  Cornuejols  and  Li  [2]  show  that  the  reduce-and-split  cuts 
are  usually  quite  different  from  the  GMI  cuts  generated  from  the  rows  of  the 
optimal  tableau  and  that,  in  some  cases,  they  can  be  significantly  stronger.  To 
illustrate  this,  we  present  a  few  instances  of  the  MIPLIB  where  the  improvement 
was  particularly  striking.  The  gap  closed  is  reported  after  20  rounds  of  cutting. 
The  last  two  columns  give  the  number  of  nodes  in  a  branch-and-bound  algorithm 
using  these  cuts  in  the  formulation. 


Name 

GMI  gap 

R&S  gap 

GMI  nodes  R&S  nodes 

flugpl 

14% 

100% 

184 

0 

gesa2 

46% 

97% 

743 

116 

gesa2o 

92% 

98% 

9145 

75 

mod008 

47% 

88% 

1409 

82 

pp08a 

83% 

92% 

7467 

745 

rgn 

15% 

100% 

874 

0 

vpml 

44% 

98% 

7132 

1 

vpm2 

41% 

61% 

38946 

4254 

In  other  instances  the  reduce-and-split  cuts  did  not  improve  on  the  GMI  cuts 
from  the  optimal  tableau.  Therefore  it  seems  that  a  hybrid  approach  that  uses 
both  types  of  cuts  is  a  reasonable  strategy. 


28 


Gerard  Cornuejols 


The  idea  of  using  basis  reduction  to  generate  strong  GMI  cuts  has  been 
suggested  independently  by  Koppe  and  Weismantel  [39]. 

4-5.  Mixed  integer  rounding  inequalities 

Nemhauser  and  Wolsey  [46],  [47]  and  Wolsey  [57]  introduced  two  definitions  of 
mixed  integer  rounding  (MIR)  inequalities.  We  follow  Wolsey  [57]. 

Lemma  3.  Consider  the  2-variable  mixed  integer  set  S  :=  {(x,  y)  £  Z  x  M+  : 
x  —  y  <  b}.  Let  fo'=b  —  |_&J  •  Then  the  inequality 

x  -  Y~J^y  -  (12) 

is  a  valid  inequality  for  conv(S) . 

Proof.  We  show  the  validity  of  (12)  in  two  different  ways,  for  x  <  [b\  and 
x  >  [b\  +  1. 

If  x  <  L&J,  then  adding  this  inequality  to  j^~j^  times  —  y  <  0  yields  (12). 

If  x  >  [b\  + 1,  then  adding  times  — x  <  —  [5J  —  1  to  times  x  —  y<b 

yields  (12).  □ 

Note  that  the  assumption  y  >  0  is  critical  in  the  above  derivation,  whereas 
we  can  have  indifferently  x  £  Z  or  x  £  Z+. 


Fig.  12.  Illustration  of  Lemma  3 


Theorem  8.  Consider  a  mixed  integer  set  defined  by  a  single  inequality:  S  := 
{(x,  y)  €  Z”  x  :  ax  +  gy  <  bj .  Let  fo  :=  b  —  [5J  and  fj  :=  aj  —  [a.j\  .  Then 
the  inequality 

)xj  +  Y~T  giyi  -  L6J 

j= 1  °  j-9j<  0 


(13) 
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is  a  valid  inequality  for  conv(S) . 

Proof.  Relax  ax  +  gy  <  b  to 

E  1®^+  E  aixi  +  E  9jVj<b. 

T-fj<fo  j'-fj  >fo  T-9j< o 

The  validity  of  the  relaxation  follows  from  x  >  0  and  y  >  0.  Let 

w:=  E  lai\xi  +  E  \ai\xi  and 

rfj<fo  3:fj  >fo 

Z  :=  -  E  9iVi  +  E  t1  ~  fi)xi- 

3:9j  <0  j’fj  >/o 

We  have  w  —  z  <  b  and  since  w  £  Z  and  z  £  K+,  we  can  apply  Lemma  3.  Thus 

Subtituting  w  and  z  yields  (13).  □ 

Wolsey  [57]  calls  the  inequality  (13)  MIR  inequality. 

Lemma  4.  Consider  a  mixed  integer  set  defined  by  a  single  inequality:  S  := 
{( x,y )  €  Z"  x  :  ax  +  gy  <  6}.  The  MIR  inequality  (13)  is  identical  to  the 
GMI  inequality. 


Proof.  The  Gomory  mixed  integer  inequality  is  obtained  by  adding  a  slack  vari¬ 
able  ax  +  gy  +  s  =  b,  generating  (9)  in  the  (. x ,  y ,  s)-space: 


E 

fj<fo 


u 


\ '  1  ~  fj , 

1  —  /o 
fj>fo  J U 


E 

9j>  0 


Vi 


\ '  9j 

^  1-fo 

9j<  0  JU 


Vi 


j  T  —s  ^  1 


fo 


and  substituting  s  =  b  —  ax  —  gy  in  this  inequality  to  get  the  GMI  inequality 
in  the  (x,  y)-space.  It  is  straightforward  to  check  that  the  resulting  inequality  is 
identical  to  (13).  □ 


Next  we  return  to  the  general  mixed  integer  linear  set  S  :=  {(x,  y)  £  Z"  x 
R(j_  :  Ax  +  Gy  <  5} .  The  proof  of  the  previous  lemma  can  be  slightly  modified 
to  show  the  next  result. 


Lemma  5.  Consider  a  mixed  integer  set  with  m  constraints  S  :=  {(x,y)  € 
Z”  x  :  Ax  +  Gy  <  b} .  Let  s  :=  5  —  Ax  —  Gy  be  a  nonnegative  vector  of  slack 
variables.  For  any  A  €  Rm,  let  a  :=  Axl,  g  :=  A G,  (3  :=  A b,  fj  :=  aj  —  |_a.y J  and 
fo  :=  (3  —  L/3J  •  The  GMI  inequality  generated  from  \Ax  +  A  Gy  +  As  =  Xb  is 

E(L«iJ  +  — 7 —)xi  +  -r^—r  E  9iyi  +  T~T  E  A*Si  -  L/?J  (14) 

i^i  1~h  1  '  •/il  Zlo 
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Any  valid  inequality  for  P  :=  {(x,  y)  G  M"  xR^  :  Ax+Gy  <b,  x  >  0,  y  >  0} 
can  be  used  to  generate  a  MIR  inequality  (13)  valid  for  S.  Define  the  MIR 
closure  relative  to  P  :=  {(x,  y)  G  K"  x  R+  :  Ax  +  Gy  <  b}  as  the  set  obtained 
from  P  by  adding  all  these  MIR  inequalities.  Lemma  4  implies  that  the  MIR 
closure  contains  the  GMI  closure.  This  inclusion  can  be  strict  as  observed  by 
Bonami  and  Cornuejols  [16],  and  Dash,  Giinluk  and  Lodi  [29]:  For  the  integer 
set  S  :=  {(xi,  X2)  G  Z^  :  2xi  +  X2  <  2,  —  2xi  +  X2  <  0},  the  inequality  X2  <  0  is 
a  GMI  inequality  (to  see  this,  use  Ai  =  \  and  A2  =  —  \)  but  one  can  show  that 
it  is  not  a  MIR  inequality.  Thus,  in  general,  inequalities  (14)  can  be  stronger 
than  (13).  This  is  because  the  vector  A  in  Lemma  5  can  have  both  positive 
and  negative  components,  whereas  (13)  corresponds  to  the  special  case  of  (14) 
where  only  nonnegative  multipliers  A j  are  used.  With  the  other  definition  of  MIR 
inequalities  (Nemhauser  and  Wolsey  [47]),  one  can  show  that  the  MIR  closure 
is  identical  to  the  GMI  closure  [47],  [26]. 

Marchand  and  Wolsey  [44]  showed  that  several  classical  inequalities  in  integer 
programming,  such  as  flow  cover  inequalities,  are  MIR  inequalities.  They  imple¬ 
mented  an  aggregation  heuristic  to  generate  MIR  cuts  with  excellent  computa¬ 
tional  results  in  the  resolution  of  mixed  integer  linear  programs.  For  example,  on 
41  MIPLIB  instances,  the  Marchand- Wolsey  aggregation  heuristic  (as  available 
in  the  COIN-OR  repository)  reduces  the  integrality  gap  by  23%  on  average  [17]. 


4-6.  Split  inequalities 

Let  P  :=  {(x,  y)  G  Rn  x  :  Ax  +  Gy  <  b}  where  A,  G,  b  have  rational  entries, 
and  let  S  :=  P  D  (Z”  x  Rp).  For  n  G  Z”  and  ttq  G  Z,  define 

II1  :=  P  <1  {(x,  y)  :  nx  <  tt0} 
n2:=Pn  {(x,  y )  :  7 rx  >  7r0  +  1} 

Clearly  S  C  III  U  772-  Therefore  any  inequality  cx  +  hy  <  cq  that  is  valid  for 
III  U II2  is  also  valid  for  S.  An  inequality  cx  +  hy  <  Cq  is  called  a  split  inequality 
[25]  if  there  exists  (7r,  7t0)  G  Z”  x  Z  such  that  cx+hy  <  Co  is  valid  for  77i  Ui72  (see 
Figure  13).  The  disjunction  irx  <  no  or  7rx  >  no  +  1  is  called  a  split  disjunction. 

Many  of  the  inequalities  that  we  have  studied  in  this  tutorial  are  split  in¬ 
equalities.  Lift-and-project  inequalities  (i.e.  the  valid  inequalities  for  fl”=1  Pj)  are 
split  inequalities  as  a  consequence  of  Theorem  5  (the  split  disjunction  is  Xj  <  0 
or  Xj  >  1).  Strengthened  lift-and-project  cuts  are  also  split  inequalities  (recall 
the  split  disjunction  ^fe=i  n kxk  >  0  or  —  1  K kXk  >  1  used  in  the  proof  of 

Theorem  6).  GMI  inequalities  are  split  inequalities  (indeed  the  disjunction  (7) 
or  (8)  is  a  split  disjunction).  Axcuts  and  reduce-and-split  cuts  are  split  inequal¬ 
ities  since  they  are  special  types  of  GMI  cuts.  MIR  inequalities  are  also  split 
inequalities  (remember  the  derivation  in  the  proof  of  Lemma  3). 

The  intersection  of  all  split  inequalities,  denoted  by  P1,  is  called  the  split 
closure  relative  to  P. 
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Fig.  13.  A  split  inequality 


Theorem  9.  (Cook,  Kannan  and  Schrijver  [25])  If  P  is  a  rational  polyhedron, 
the  split  closure  relative  to  P  is  a  rational  polyhedron. 

For  k  >  2,  Pk  denotes  the  split  closure  relative  to  Pfc_1  and  it  is  called  the 
k-th  split  closure  relative  to  P.  It  follows  from  the  above  theorem  that  Pk  is  a 
polyhedron.  Unlike  for  the  pure  integer  case  [21],  [52],  there  is  in  general  no  finite 
r  such  that  Pr  =  conv(5')  in  the  mixed  integer  case,  as  shown  by  the  following 
example  [25]. 

Example  1.  Let  S  :=  {(x,y)  €  Z+  x  R+  :  X\  >  y,  X2  >  y ,  X\  +  X2  +  2 y  <  2}. 
Starting  from  P  :=  {(x\,  X2,  y)  G  R+  :  x\  >  y,  X2  >  y,  X\  +  X2  +  2 y  <  2},  we 
claim  that  there  is  no  finite  r  such  that  Pr  =  conv(S'). 

To  see  this,  note  that  P  is  a  simplex  with  vertices  O  =  (0,  0, 0),  A  =  (2,  0, 0), 
B  =  (0, 2,  0)  and  C  =  (^,  (see  Figurel4).  S  is  contained  in  the  plane  y  =  0. 
More  generally,  consider  a  simplex  P  with  vertices  O,  A,  B  and  C  =  (|,  i,  t)  with 
t  >  0.  Let  Ci  =  C,  let  C2  be  the  point  on  the  edge  from  C  to  A  with  coordinate 
Xi  =  1  and  C3  the  point  on  the  edge  from  C  to  B  with  coordinate  X2  =  1. 
Observe  that  no  split  disjunction  removes  all  three  points  Ci,  C2,  C3.  Let  Qi  be 
the  intersection  of  all  split  inequalities  that  do  not  cut  off  Ci.  All  split  inequalities 
belong  to  at  least  one  of  these  three  sets,  thus  P1  =  Q\  n  Q2  n  Q3.  Let  be  the 
simplex  with  vertices  O,  A,  B,Ct.  Clearly,  Si  C  Qi.  Thus  S\  Pi  S2  Ci  S3  C  P1 .  It 
is  easy  to  verify  that  (|,  |,  |)  €  S,  for  i  =  1,2  and  3.  Thus  (|,  |)  G  P1.  By 

induction,  (5,3,^)  G  Pk ■ 


Remark  9.  For  mixed  0,1  programs,  Theorem  7  implies  that  P"  =  conv(S')  (In¬ 
deed,  the  lift-and-project  polytope  Pi  contains  the  split  closure  relative  to  P  by 
Theorem  5.  Similiarly,  P2(Pi))  contains  the  2nd  split  closure,  etc). 

Example  2.  Cornuejols  and  Li  [27]  observed  that  the  n-th  split  closure  is  needed 
for  0,1  programs,  i.e.  there  are  examples  where  Pk  ^  conv(«S)  for  all  k  <  n.  They 
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Fig.  14.  Example  showing  that  the  split  rank  can  be  unbounded 

use  the  following  well-known  polytope  studied  by  Chvatal,  Cook,  and  Hartmann 
[22]: 

Pcch  ■■=  {x  G  [0,1]"  :  +X](1  -  xo)  -  for  a11  J  -  I1:  V  ■■)«}} 

j&J  j?J 

Let  Fj  be  the  set  of  all  vectors  x  G  Rn  such  that  j  components  of  x  are  \ 
and  each  of  the  remaining  n  —  j  components  are  equal  to  0  or  1.  The  poly  tope 
Pcch  is  the  convex  hull  of  F\. 

Lemma  6 .  If  a  polyhedron  P  C  Rra  contains  Fj,  then  its  split  closure  P 1  con¬ 
tains  Fj+ 1. 

Proof.  It  suffices  to  show  that,  for  every  (n,  no)  G  Zn  x  Z,  the  polyhedron  Ft  = 
conv((Pfl{x  :  nx  <  7r0})U(Pn{:r  :  nx  >  7r0  +  l}))  contains  Pj+i-  Let  v  G  Pj+i 
and  assume  w.l.o.g.  that  the  first  j  +  1  elements  of  v  are  equal  to  If  nv  G  Z, 
then  clearly  v  G  17.  If  nv  £  Z,  then  at  least  one  of  the  first  j  +  1  components  of 
n  is  nonzero.  Assume  w.l.o.g.  that  n\  >  0.  Let  Vi,V2  G  Fj  be  equal  to  v  except 
for  the  first  component  which  is  0  and  1  respectively.  Notice  that  v  =  1,1  jp'2 . 
Clearly,  each  of  the  intervals  [nv\,nv\  and  [nv,nv2\  contains  an  integer.  Since 
nx  is  a  continuous  function,  there  are  points  V\  on  the  line  segment  conv(u,iq) 
and  V2  on  the  line  segment  conv(v,t>2)  with  nv\  G  Z  and  nV2  G  Z.  This  means 
that  t>i  and  V2  are  in  77.  Since  v  G  conv(tq,  V2),  this  implies  v  G  77.  □ 

Starting  from  P  =  Pcch  and  applying  the  lemma  recursively,  it  follows  that 
the  (n  —  l)-st  split  closure  relative  to  Pcch  contains  Fn,  which  is  nonempty. 
Since  cony  (Pcch  (~l  {0,  l}n)  is  empty,  the  n-th  split  closure  is  needed  to  obtain 
con y(PccH  H  {0, 1}").  End  of  Example  2. 

Remark  1 0.  In  view  of  Example  1  showing  that  no  bound  may  exist  on  the  split 
rank  when  the  integer  variables  are  general,  and  Remark  9  showing  that  the  rank 
is  always  bounded  when  they  are  0,1  valued,  one  is  tempted  to  convert  general 
integer  variables  into  0,1  variables.  For  a  bounded  integer  variable  0  <  x  <  u, 
there  are  several  natural  tranformations: 

(i)  a  binary  expansion  of  x  (see  Owen  and  Mehrotra  [49]); 
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(ii)  x  =  J2zi  <  1)  zi  G  {0,1}  (see  Sherali  and  Adams  [55]  and 

Koppe,  Louveaux  and  Weismantel  [38]); 

(iii)  x  =  Yh=  i  zu  zi  <  z»-i,  zi  €  {0, 1}  (see  Roy  [51]). 

Roy  [51]  also  shows  that  practical  benefits  can  be  gained  from  such  a  transfor¬ 
mation. 


4-7.  Chvatal  inequalities 

Let  P  :=  {(x,y)  G  R+  x  :  Ax  +  Gy  <  b}  be  a  rational  polyhedron  and  let 
S  :=  {( x,y )  G  Z"  x  R+  :  Ax  +  Gy  <  b}.  Define  a  Chvatal  inequality  to  be  an 
inequality  nx  <  ttq  with  (7 r,  no)  G  Z”  x  Z,  where 

Pn[(l,J/)  :  7TCC  >  7To  +  1}  =  0. 

It  follows  from  the  definition  that  Chvatal  inequalities  are  split  inequalities  where 
one  of  the  sets  I7i  or  II 2  is  empty. 

Lemma  7.  Chvatal  inequalities  are  GMI  inequalities. 

Proof.  The  lemma  holds  when  P  =  0.  Assume  now  that  0.  Let 

(3  =  max  nx 

(x,  y )  G  P. 

Then  nx  <  (3  is  a  valid  inequality  for  P.  Since  Pn{(i,t/):  nx  >  7r0  +  1}  =  0, 
it  follows  that  /3  <  no  +  1.  Applying  the  MIR  formula  (13)  to  nx  <  (3,  it  follows 
that  the  inequality  nx  <  7To  is  a  MIR  inequality.  By  Lemma  4,  this  inequality  is 
a  GMI  inequality.  □ 

Define  the  Chvatal  closure  relative  to  P  as  the  intersection  of  P  with  all 
the  Chvatal  inequalities.  How  does  the  Chvatal  closure  compare  to  the  lift-and- 
project  closure?  The  answer  is  that  neither  is  included  in  the  other  in  general, 
as  shown  by  the  following  2-variable  examples:  For  P  :=  {x  £  :  x2  < 

2x\,  2x\  +  x2  <  2}  and  5  :=  P  D  {0,  l}2,  the  inequality  x2  <  0  is  a  lift- 
and-project  inequality  but  not  a  Chvatal  inequality.  On  the  other  hand,  for 
P  :=  {x  G  R+  :  Xi  +  x2  <  1.5,  x\  <  1,  x2  <  1}  and  S  :=  P  fl  {0,  l}2, 
the  inequality  x\  +  x2  <  1  is  a  Chvatal  inequality  but  not  a  lift-and-project 
inequality. 

Although  it  is  NP-hard  to  optimize  over  the  Chvatal  closure,  there  are  empir¬ 
ical  results  on  its  strength.  Bonami,  Cornuejols,  Dash,  Fischetti,  Lodi  [17]  found 
that  the  Chvatal  closure  closes  at  least  29  %  of  the  integrality  gap  on  average  on 
41  MIPLIB  instances  (all  the  MIPLIB  3  instances  that  have  at  least  one  contin¬ 
uous  variable  and  nonzero  integrality  gap) .  For  the  remaining  24  instances  (pure 
integer  programs  in  MIPLIB  3  with  nonzero  integrality  gap),  Fischetti  and  Lodi 
[31]  found  that  the  Chvatal  closure  closes  at  least  63  %  of  the  integrality  gap  on 
average. 


34 


Gerard  Cornuejols 


4-8.  Equivalence  between  split  inequalities  and  GMI  inequalities 

Nemhauser  and  Wolsey  [46],  [47]  studied  the  relation  between  split  closure  and 
GMI  closure.  The  equivalence  proof  given  here  is  based  on  [27].  It  uses  Lemma 

1. 

Theorem  10.  Let  P  :=  {( x,y )  €  R"  x  R+  :  Ax  +  Gy  <  b}  be  a  rational 
polyhedron  and  let  S  :=  PC l(Z”  x  Rp).  The  split  closure  relative  to  P  is  identical 
to  the  Gomory  mixed  integer  closure  relative  to  P. 

Proof.  We  may  assume  that  the  constraints  x  >  0  and  y  >  0  are  part  of  Ax  + 
Gy  <  b  in  the  description  of  P. 

Consider  first  a  GMI  inequality.  Its  derivation  in  Section  4.1  was  obtained  by 
arguing  that  k  =  \  b\  —  <f0  [aj\xj  —  S/j>/0  \aj~\xj  an  integer,  and  either 

k  <  —  1  or  k  >  0.  This  is  a  disjunction  of  the  form  nx  <  no  or  nx  >  ttq  +  1  with 
(7T,  7To)  £  Z"  x  Z.  Thus  the  derivation  of  the  GMI  inequality  implies  that  it  is  a 
split  inequality. 


Conversely,  let  cx+hy  <  cq  be  a  split  inequality.  Let  nx  <  no  or  nx  >  no  + 1 
denote  the  split  disjunction  used  in  deriving  this  inequality,  where  ( n ,  no)  € 
Z”  x  Z,  and  let  IJi,  77 2  be  the  corresponding  intersections  with  P . 

First  assume  that  II2  =  0.  Then  the  inequality  cx  +  hy  <  cq  is  valid  for 
III.  Since  all  the  inequalities  that  define  77i  are  valid  for  P  except  possibly  for 
nx  <  no,  it  suffices  to  show  that  nx  <  no  is  a  GMI  inequality.  This  follows  from 
the  fact  that  nx  <  no  is  a  Chvatal  inequality  (since  II2  :=  P  D  {(x,  y)  :  nx  > 
tto  +  1}  =  0)  and  from  Lemma  7. 

A  similar  argument  holds  when  77i  =  0,  so  we  now  assume  that  77i  7^  0  and 
772  yf  0. 

By  Lemma  1,  there  exist  a,  (3  £  R+  such  that 

cx  +  hy  —  a{nx  —  no)  <  cq  and  (15) 

cx  +  hy  +  (3{nx  —  (7ro  +  1))  <  Co  (16) 

are  both  valid  for  P.  We  can  assume  a  >  0  and  (3  >  0  since,  otherwise,  cx  +  hy  < 
Co  is  valid  for  P  and  therefore  also  for  its  Gomory  mixed  integer  closure.  We  now 
apply  the  Gomory  mixed  integer  procedure  to  (15)  and  (16).  Introduce  slack 
variables  si  and  s2  in  (15)  and  (16)  respectively  and  subtract  (15)  from  (16). 

(a  +  (3)nx  +  s2  —  si  =  (a  +  (3)  7r0  +  (3. 

Dividing  by  a  +  (3  we  get 


52 _ Si 

oc  +  (3  oc  +  (3 


P 

ex  +  (3 


nx  + 


=  ttq  + 


Valid  Inequalities  for  MILPs 


35 


From  this  equation,  we  can  derive  a  GMI  inequality.  Note  that  fo  =  and 
that  the  continuous  variable  S2  has  a  positive  coefficient  while  si  has  a  negative 
coefficient.  So  the  GMI  inequality  is 


1 

a+/3 

<3 

a+/3 


S2  + 


1 

Q-+/3 


1_  _A_ 

1  a+/3 


Si  >  1 


which  simplifies  to 

11, 

-Sl  +  7jS‘2  >  1. 

a  p 

We  now  replace  si  and  S2  as  defined  by  the  equations  (15)  and  (16)  to  get  the 
GMI  inequality  in  the  space  of  the  x,  y  variables.  The  resulting  inequality  is 


cx  +  hy  <  cq  . 


Therefore  cx  +  hy  <  cq  is  a  GMI  inequality. 


□ 


The  split  closure  is  identical  to  the  GMI  closure.  How  tight  is  it  in  practice? 
Balas  and  Saxena  [11]  addressed  this  question  by  formulating  the  separation 
problem  for  the  split  closure  as  a  parametric  mixed  integer  linear  program  with 
a  single  parameter  in  the  objective  function  and  the  right  hand  side.  They  found 
that  the  split  closure  closes  82  %  of  the  integrality  gap  on  average  for  33  MI- 
PLIB  instances  of  mixed  integer  programs,  and  71  %  on  average  for  24  MIPLIB 
instances  of  pure  integer  programs.  This  experiment  shows  that  the  split  closure 
is  surprisingly  strong.  It  is  interesting  to  compare  the  above  82  %  figure  with 
the  48  %  obtained  by  adding  GMI  cuts  from  the  optimal  tableau  +  MIR  cuts 
+  lift-and-project  cuts  [17].  This  is  a  clear  indication  that  there  are  useful  split 
cuts  that  are  currently  not  exploited  in  integer  programming  codes.  Finding  deep 
split  cuts  efficiently  remains  a  challenging  practical  issue. 


4-9.  Intersection  cuts 

Intersection  cuts  were  introduced  by  Balas  [4].  They  are  obtained  from  a  basis 
of  the  linear  programming  relaxation  and  a  convex  set  that  contains  the  cor¬ 
responding  basic  solution  but  no  integer  feasible  solution  in  its  interior.  In  this 
section,  we  focus  on  convex  sets  derived  from  split  disjunctions.  For  convenience, 
we  assume  that  the  constraints  are  in  equality  form.  Let  S  :=  P  fl  (Zn  x  Rp) 
with 

Pxfielf:  Ax  =  b} 

where  A  £  Rmx(n+p)  and  b  £  Rm.  Wlog  assume  that  A  is  of  full  row  rank.  Let 
B  index  m  linearly  independent  columns  of  A  {B  is  a  basis )  and  let  J  index  the 
remaining  columns  of  A  (the  nonbasic  variables).  Let  P(B)  be  the  relaxation  of 
P  obtained  by  deleting  the  nonnegativity  constraints  on  the  basic  variables: 

P{B )  :=  {x  £  Rn+P  :  Ax  =  b  and  Xj  >  0  for  j  e  J}. 


(17) 


36 


Gerard  Cornuejols 


The  convex  hull  of  P(B)  fl  (Zra  x  Rp)  was  studied  by  Gomory  [35]  under  the 
name  of  corner  polyhedron.  Clearly,  S  is  contained  in  the  corner  polyhedron 
and,  therefore,  valid  inequalities  for  the  corner  polyhedron  are  also  valid  for  S. 

We  may  write  P(B)  =  x  +  C,  where  C  is  the  polyhedral  cone  C  :=  {x  £ 
R"+p  .  ax  _  q  anc[  x.  >  q  for  j  g  jj5  and  x  is  basic  solution  that  solves 
the  system  Ax  =  b  and  Xj  =  0  for  j  £  J: 

Xi  =  Xi  —  ^  dijXj,  i  £  B.  (18) 

jeJ 

The  extreme  rays  of  C  are  the  following  vectors  r°  for  j  £  J: 

—dkj  if  k  £  B , 

1  if  k  =  j,  (19) 

0  if  k  £  J\{j}- 

Thus  P{B)  =  x  +  cone({rJ}JgJ),  where  cone({rJ}jSj)  denotes  the  polyhedral 

cone  generated  by  the  vectors  {rJ}jeJ. 

We  now  derive  the  intersection  cut.  Consider  an  arbitrary  split  disjunction 
ttx  <  ttq  or  ttx  >  7To  + 1,  where  tt  £  Zn  x  {0}p  and  no  £  Z.  Assume  x  violates  the 
split  disjunction,  and  define  e  :=  nx  —  ttq  to  be  the  amount  by  which  x  violates 
the  first  term  of  the  disjunction.  Since  no  <  nx  <  no  +  1,  we  have  0  <  e  <  1. 
Also,  for  j  £  J,  define  scalars: 

(20) 


+  1 


Fig.  15.  Intersection  cut 


f  -nfn  if  nr\  <  0, 

a3  =  {  if 

I  +oo  otherwise. 


The  interpretation  of  ay  is  the  following.  Consider  the  half-line  x  +  arJ , 
where  a  £  R+ ,  starting  from  x  in  the  direction  r° .  The  value  ay  is  the  smallest 
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a  G  R+  such  that  x  +  ar3  satisfies  the  split  disjunction.  In  other  words,  the  point 
x  +  ajTJ  is  the  intersection  of  the  half-line  starting  in  x  in  direction  r3  with  the 
hyperplane  ttx  =  no  or  the  hyperplane  nx  =  no  +  1  (see  Figure  15).  Note  that 
ay  =  +oo  when  the  direction  r3  is  parallel  to  the  hyperplane  nx  =  no-  Given 
the  numbers  ay  for  j  G  J,  the  intersection  cut  associated  with  B  and  the  split 
disjunction  nx  <  no  or  nx  >  no  +  1,  is  given  by: 


—  >  1-  (21) 

CXj 

This  inequality  is  valid  for  the  corner  polyhedron  conv(P(l?)  (~l  (Zn  x  Rp)) 
since  it  is  a  split  inequality.  In  fact,  the  intersection  cut  gives  a  complete  de¬ 
scription  of  the  set  of  points  in  P(B)  that  satisfy  the  split  disjunction: 

con v(P(P)  fl  ({x  :  nx  <  7To}  U  {x  :  nx  >  no  +  1}))  =  P(B)  fl  {x 

T  OLi 
jeJ  J 

Andersen,  Cornuejols  and  Li  [1]  showed  that  intersection  cuts  are  sufficient  to 
describe  the  split  closure  Pl  relative  to  P.  Let  B  denote  the  set  of  all  bases  of 
A.  We  have: 

Theorem  11. 

Pl  =  n  n  conv(P(B)  n  ({x  :  nx  <  no}  U  {x  :  nx  >  no  +  1})). 

BeB  igZ"x{0}P,  7r06Z 

The  following  lemma  shows  that  the  GMI  cuts  derived  from  rows  of  the 
simplex  tableau  are  intersection  cuts: 


Lemma  8.  Let  B  be  a  basis  of  A,  and  let  x  be  the  corresponding  basic  solution. 
Also,  let  Xi  be  a  basic  integer  constrained  variable,  and  suppose  ay  is  fractional. 
The  GMI  cut  obtained  from  the  row  of  the  simplex  tableau  in  which  Xi  is  basic  is 
given  by  the  inequality  j  7T  —  where  ctj  is  computed  using  formula  (20) 
with  no  :=  [xjj ,  and  for  j  =  1, ...  ,n: 


n  j  ■■= 


[dij\  if  j  G  J  and  fj  <  f0, 
\dij 1  if  j  G  J  and  fj  >  f0, 

1  ifj  =  i, 

0  otherwise. 


(22) 


Proof.  Let  us  compute  ay  for  the  above  disjunction  using  formula  (20),  where 
j  G  J.  We  have: 


e  =  nx  -  n0  =  Xi  -  \_Xi\  =  f0. 
Using  (19)  and  (22),  we  get 


777 
7T  rJ  =  TTiTi  —  7TjrJj 


— f  j  if  1  <  j  <  n  and  /,•  <  f0, 
1  ~  fj  if  1  <  j  <  n  and  /y  >  /0, 
—dij  if  n  +  1  <  j  <  n  +  p. 


(23) 


Now  ay  follows  from  formula  (20).  This  yields  the  GMI  cut  as  claimed.  □ 
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5.  Conclusion 


Fig.  16.  Relations  between  families  of  cuts 


This  tutorial  considered  mixed  integer  linear  sets  S  :=  P  ft  (Z™  x  RS_)  where 
P  :=  {(x,y)  £  R”  x  R+  :  Ax  +  Gy  <  b}  is  a  polyhedron.  It  presented  several 
families  of  valid  inequalities  for  S  and  compared  the  corresponding  elementary 
closures.  See  Figure  16.  The  Gomory  mixed  integer  (GMI)  closure  is  identical  to 
the  split  closure.  This  set  can  also  be  obtained  using  intersection  cuts  generated 
from  all  the  basic  solutions  (both  feasible  and  infeasible) .  Computational  exper¬ 
iments  show  that  the  split  closure  is  surprisingly  tight  (it  closes  over  75  %  of  the 
integrality  gap  on  average  on  MIPLIB  3  instances).  Unfortunately  separating  or 
optimizing  over  the  split  closure  is  NP-hard.  This  justifies  our  interest  in  families 
of  split  cuts  that  can  be  generated  efficiently:  GMI  cuts  from  the  optimal  basis, 
MIR  cuts  obtained  heuristically,  reduce-and-split  cuts  and  lift-and-project  cuts. 
They  appear  in  the  bottom  of  the  figure.  On  the  right  hand  side  of  Figure  16 
appear  elementary  closures  that  are  defined  for  mixed  0,1  programs  only,  from 
weakest  at  the  bottom  to  tightest  at  the  top. 

Integer  programming  solvers  went  from  using  no  general  cutting  planes  twenty 
years  ago  to  using  many  rounds  of  such  cuts  nowadays.  Much  larger  instances 
can  be  solved,  but  this  aggressive  cutting  may  sometimes  result  in  the  optimum 
solution  being  cut  off  due  to  numerical  difficulties  with  cuts  from  later  rounds. 
Have  we  reached  the  limits  of  what  can  be  acheived  with  general  cuts?  I  do  not 
think  so.  The  split  closure  is  surprisingly  tight  and  integer  programming  solvers 
should  probably  do  a  better  job  of  approximating  it  before  generating  cuts  of 
higher  rank. 

We  conclude  with  three  possible  research  directions.  The  previous  paragraph 
suggests  that  one  should  look  for  new  classes  of  deep  split  cuts  that  can  be  sepa¬ 
rated  rapidly,  or  just  use  the  currently  known  families  more  efficiently.  Variable 
transformations  are  intriguing  (see  Remark  10)  and  deserve  more  attention  in  the 
context  of  cutting  planes.  Obtaining  compact  higher  dimensional  formulations 
by  applying  Theorem  4  is  another  promising  direction  for  research. 
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